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(57) Abstract 

Novel polypeptides possessing part or all of the primary 
structural conformation and one or more of the biological pro- 
perties of mammalian erythropoietin ('EPO') which are charac- 
terized in preferred forms by being the product of procaryotic 
or eucaryotic host expression of an exogenous DNA sequence. 
Illustratively, genomic DNA, cDNA and manufactured DNA 
sequences coding for part or all of the sequence of amino acid 
residues of EPO or for analogs thereof are incorporated into 
autonomously replicating plasmid or viral -vectors employed to. 
transform or transfect suitable procaryotic or eucaryotic host 
cells such as bacteria, yeast or vertebrate ceils in culture. Upon 
isolation from culture media or cellular lysates or fragments, 
products of expression of the DNA sequences display, e.g., the 
immunological properties and in vitro and in vivo biological ac- 
tivities of EPO of human or monkey species origins. Disclosed 
also are chemically synthesized polypeptides sharing the bio- 
chemical and immunological properties of EPO. Also disclosed 
are improved methods for the detection of specific single 
stranded polynucleotides in a heterologous cellular or viral 
sample prepared from, e.g., DNA present in a plasmid or viral- 
borne cDNA or genomic DNA 'library'. 
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This is a continuation-in-part of my co-pending 
U.S. Patent Application Serial Nos. 561,024, filed 
December 13, 1983, 582,185, filed February 21, 1984, and 
655,841, filed September 28, 1984. 



BACKGROUND 

The present invention relates generally to the 
manipulation of genetic materials and, more particularly, 
to recombinant procedures making possible the production 
of polypeptides possessing part or all of the primary 
structural conformation and/or one or more of the biolo- 
gical properties of naturally-occurring erythropoietin. 

A. Manipulation Of Gen atic Materials 

Genetic materials may be broadly defined as 
those chemical substances which program for and guide the 
manufacture of constituents of cells and viruses and 
direct the responses of cells and viruses. A long chain 
polymeric substance known as deoxyribonucleic acid (DMA) 
comprises the genetic material of all living cells and 
viruses except for certain viruses which are programmed 
25 by ribonucleic acids (RNA). The repeating units in ONA 
polymers are four different nucleotides, each of which 
consists of either a purine (adenine or guanine) or a 
pyrimidine (thymine or cytosine) bound to a deoxyribose 
sugar to which a phosphate group is attached. Attachment 
of nucleotides in linear polymeric form is by means of 
fusion of the 5' phosphate of one nucleotide to the 3 f 
hydroxyl group of another. Functional ONA occurs in the 
form of stable double stranded associations of single 
strands of nucleotides (known as deoxyoligonucleotides ) , 



30 
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which associations occur by means of hydrogen bonding 
between purine and pyrimidine bases [i.e., 
"complementary" associations existing either between ade- 
nine (A) and thymine CT) or guanine- CG) and cytosine 

5 (C)] . By convention, nucleotides are referred to by the 
names of their constituent purine or pyrimidine bases, 
and the complementary associations of nucleotides in 
double stranded DNA (i.e., A-T and G-C) are referred to. 
as "base pairs". Ribonucleic acid is a polynucleotide 

10 comprising adenine, guanine, cytosine and uracil (U), 
rather than thymine, bound to ribose and a phosphate 
group. 

Most briefly put, the programming function of 
DNA is generally effected through a process wherein spe- 
15 cific DNA nucleotide sequences (genes) are "transcribed" 
into relatively unstable messenger RNA (mRNA ) polymers. 
The mRNA, in turn, serves as a template for the formation 
of structural, regulatory and catalytic proteins from 
amino acids. This mRNA "translation" process involves 
20 the operations of small RNA strands CtRNA) which. 

transport and align individual amino acids along the mRNA 
strand to allow for formation of polypeptides in proper 
amino acid sequences. The mRNA "message", derived from 
DNA and providing the basis for the tRNA supply and 

25 orientation of any given one of the twenty amino acids 
for polypeptide "expression", is in the form of triplet 
"codons" -- sequential groupings of three nucleotide 
bases. In one sense, the formation of a protein' is the 
ultimate form of "expression" of the programmed genetic 

30 message provided by the nucleotide sequence of a gene. 

"Promoter" DNA sequences usually "precede" a 
gene in a DNA polymer and provide a site for initiation 
of the transcription into mRNA. "Regulator" DNA sequen- 
ces, also usually "upstream" of (i.e., preceding) a gene 

35 in a given DNA polymer, bind proteins that determine the 
frequency Cor rate) of transcriptional initiation. 
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Collectively referred to as "promoter/regulator" or 
"control" DNA sequence, these sequences which precede a 
selected gene (or series of genes) in a functional DNA 
polymer cooperate to determine whether the transcription 
5 (and eventual expression) of a gene will occur. DNA 

sequences which "follow" a gene in a DNA polymer and pro- 
vide a signal for termination of the transcription into 
mRNA are referred to as transcription "terminator" 
sequences. 

10 a focus of microbiological processing for the 

last decade has been the attempt to manufacture 
industrially and pharmaceutical^ significant substances 
using organisms which either do not initially have gene- 
tically coded information concerning the desired product 

15 included in their DNA, or (in the case of mammalian cells 
in culture) do not ordinarily express a chromosomal gene 
at appreciable levels. Simply put, a gene that specifies 
the structure of a desired polypeptide product is either 
isolated from a "donor" organism or chemically synthe- 

20 sized and then stably introduced into another organism 
which is preferably a self-replicating unicellular orga- 
nism such as bacteria, yeast or mammalian cells in 
culture. Once this is done, the existing machinery for 
gene expression in the "transformed" or " transf ected" 

25 microbial host cells operates to construct the desired 
product, using the exogenous DNA as a template for 
transcription of mRNA which is then translated into a 
continuous sequence of amino acid residues. 

The art is rich in patent and literature publi-. 

30 cations relating to "recombinant DNA" methodologies for 
the isolation, synthesis, purification and amplification 
of genetic materials for use in the transformation of 
selected host organisms. U.S. Letters Patent 
No. 4,237,224 to Cohen, et al . , for example, relates to 

35 transformation of unicellular host organisms with 

"hybrid" viral or circular plasmid ONA which includes 
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selected exogenous DNA sequences. The procedures of the 
Cohen, et al. patent first involve manufacture of a 
transformation vector by enzymatically cleaving viral or 
circular plasmid DNA to form linear DNA strands. 
5 Selected foreign ("exogenous" or "heterologous") DNA 
strands usually including sequences coding for desired 
product are prepared in linear form through use of simi- 
lar enzymes. The linear viral or plasmid DNA is incu- 
bated with the foreign DNA in the presence of ligating 

10 enzymes capable of effecting a restoration process and 
"hybrid" vectors are formed which include the selected 
exogenous DNA segment "spliced" into the viral or cir- 
cular DNA plasmid. 

Transformation of compatible unicellular host 

15 organisms with the hybrid vector results in the formation 
of multiple copies of the exogenous DNA in the host cell 
population. In some instances, the desired result is 
simply the amplification of the foreign DNA and the 
"product" harvested is DNA. More frequently, the goal of 

20 transformation is the expression by the host cells of the 
exogenous DNA in the form of large scale synthesis of 
isolatable quantities of commercially significant protein 
or polypeptide fragments coded for by the foreign DNA. 
See also, e.g., U.S. Letters Patent Nos. 4,264,731 (to 

25 Shine), 4,273,875 (to Manis), 4,293,652 (to Cohen), and 
European Patent Application 093,619, published November 
9, 1983. 

The development of specific DNA sequences for 
splicing i-nto DNA vectors is accomplished by a variety of 

30 techniques, depending to a great deal on the degree of 
" foreignness" of the "donor" to the projected host and 
the size of the polypeptide to be expressed in the host. 
At the risk of over-simplification, it can be stated that 
three alternative principal methods can be employed; (1) 

35 the "isolation" of double-stranded DNA sequence from the 
genomic DNA of the donor; (2) the chemical manufacture of 
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a ONA sequence providing a code for a polypeptide of 
interest; and (3) the in vitro synthesis. of a double- 
stranded ONA sequence by enzymatic "reverse transcrip- 
tion" of mRNA isolated from donor cells. The 
5 last-mentioned methods which involve formation of a ONA 
••complement 1 ' of mRNA are generally referred to as "cONA" 
methods . 

Manufacture of DNA sequences is frequently the 
method of choice when the entire sequence of amino acid 

10 residues of the desired polypeptide product is known. 
DNA manufacturing procedures of co-owned, co-pending 
U.S. Patent Application Serial No. 483,451, by Alton, et 
al., (filed April 15, 1983 and corresponding to PCT 
US83/00605, published November 24, 1983 as W083/04053), 

15 for example, provide a superior means for accomplishing 
such highly desirable results as: providing for the pre- 
sence of alternate codons commonly found in genes which 
are highly expressed in the host organism selected for 
expression (e.g., providing yeast or E.coli "preference" 

20 codons); avoiding the presence of untranslated "intron" 
sequences (commonly present in mammalian genomic ONA 
sequences and mRNA transcripts thereof) which are not 
readily processed by procaryotic host cells; avoiding 
expression of undesired "leader" polypeptide sequences 

25 commonly coded for by genomic DNA and cONA sequenc.es but 
frequently not readily cleaved from the polypeptide of 
interest by bacterial or yeast host cells; providing for 
ready insertion of the DNA in convenient expression vec- 
tors in association with desired promoter/regulator and 

30 terminator sequences; and providing for ready construc- 
tion of genes coding for polypeptide fragments and ana- 
logs of the desired polypeptides. 

When the entire sequence of amino acid residues 
of the desired polypeptide is not known, direct manufac- 

35 ture of ONA sequences is not possible and isolation of 

DNA sequences coding for the polypeptide by a cONA method 
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becomes the method of choice despite the potential 
drawbacks in ease of assembly of expression vectors 
capable of providing high levels of microbial expression 
referred to above. Among the standard procedures for 
5 isolating cDNA sequences of interest is the preparation 
of plasmid-borne cONA "libraries" derived from reverse 
transcription of mRNA abundant in donor cells selected as 
responsible for high level expression of genes (e.g., 
libraries of cDNA derived from pituitary cells which 
10 express relatively large quantities of growth hormone 
products). Where substantial portions of the polypep- 
tide's amino acid sequence are known, labelled, single- 
stranded DNA probe sequences duplicating a sequence 
putatively present in the "target" cDNA may be employed 
15 in ONA/DNA hybridization procedures carried out on cloned 
copies of the cDNA which have been denatured to single 
stranded form. Csee, generally, the disclosure and 
discussions of the art provided in U.S. Patent No. 
4,394,443 to Weissman, et al. and the recent demonstra- 
20 tions of the use of long oligonucleotide hybridization 
probes reported in Wallace, et al., Nuc. Acids Res. , 6, 
pp. 3543-3557 (1979), and Reyes, et al . , P.N.A.S. 
(U.S.A.) , 79, pp. 3270-3274 (1982), and Jaye, et al., 
Nuc. Acids Res. , II, pp. 2325-2335 (1983). See also, U.S. 
25 Patent No. 4,358,535 to Falkow, et al., relating to 

DNA/DNA hybridization procedures in effecting diagnosis; 
published European Patent Application Nos. 0070685 and 
0070687 relating to light-emitting labels on single 
stranded polynucleotide probes; Davis, et al-. , ,f A Manual 
30 for Genetic Engineering, Advanced Bacterial Genetics", 
Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 
(1980) at pp. 55-58 and 174-176, relating to colony and 
plaque hybridization techniques; and, New England Nuclear 
(Boston, Mass.) brochures for "Gene Screen 11 Hybridization 
35 Transfer Membrane materials providing instruction manuals 
for the transfer and hybridization of DNA and RNA, 
Catalog No. NEF-972.3 
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Among the more signficant recent advances in 
hybridization procedures " for the screening of recombinant 
clones is the use of labelled mixed synthetic oligo- 
nucleotide probes, each of which is potentially the 
5 complete complement of a specific DNA sequence in the 
hybridization sample including a heterogenous mixture of 
single stranded DNAs or RNAs. These procedures are 
acknowledged to be especially useful in the detection of 
cDNA clones derived from sources which provide extremely 

10 low amounts of mRNA sequences for the polypeptide of 
interest. Briefly put, use of stringent hybridization 
conditions directed toward avoidance of non-specific 
binding can allow, e.g., for the autoradiographic 
visualization of a specific cDNA clone upon the event of 

15 hybridization of the target ONA to that single probe 

within the mixture which is its complete complement. See 
generally, Wallace, et al., Nuc. Acids Res. , 9, pp. 
879-897 (1981); Suggs, et al . P.N.A.S. (U.S.A.) , 78, pp. 
6613-6617 (1981); Choo, et al., Nature, 299 , pp. 178-180 

20 (1982); Karachi, et al., P.N.A.S. (U.S.A. ) , 79, 

pp. 6461-6464 (1982); Ohkubo, et al., P.N.A.S. (U.S.A.) , 
80,. pp. 2196-2200 (1983); and Kornblihtt, et al. 
P.N.A.S. (U.S.A.) , 80, pp. 3218-3222 (1983). In general, 
the mixed probe procedures of Wallace, et al. (1981), 

2 5 supra , ' have been expanded upon by various workers to the 
point where reliable results have reportedly been 
obtained in a cDNA clone isolation using a 32 member 
mixed "pool" of 16-base-ldrig (16-mer) oligonucleotide 
_ probes of uniformly, varying DNA sequences together with 

30 a single 11-mer to effect a two-site "positive" confir- 
mation of the presence of cDNA of interest. See, 
Singer-Sam, et al., P.N.A.S. (U.S.A. ) , 80 , pp. 802-806 
(1983) . 

The use of genomic ONA isolates is the least 
35 common of the three above-noted methods for developing 
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specific DNA sequences for use in recombinant procedures. 
This is especially true in the area of recombinant proce- 
dures directed to securing microbial expression of mam- 
malian polypeptides and is due, principally to the 
5 complexity of mammalian genomic DNA. Thus, while 
reliable procedures exist for developing phage-borne 
libraries of genomic DNA of human and other mammalian 
species origins [See, e.g., Lawn, et al. Cell, 15, 
pp. 1157-1174 (1978) relating to procedures for 
10 generating a human genomic library commonly referred to 
as the "Maniatis Library** t Karn, et al., P.N.A.S. 
(U.S. A. 3 , 77, pp. 5172-5176 (1980) relating to a human 
genomic library based on alternative restriction endo- 
nuclease fragmentation procedure? and Blattner, et al., 
15 Science. 196, pp. 161-169 (1977) describing construction 
of a bovine genomic library] there have been relatively 
few successful attempts at use of hybridization proce- 
dures in isolating genomic DNA in the absence of exten- 
sive foreknowledge of amino acid or DNA sequences. As 
20 one example, Fiddes, et al., J.Mol. and Aop.Genetics, 1, 
pp. 3-18 (1981) report the successful isolation of a gene 
coding for the alpha subunit of the human pituitary gly- 
coprotein hormones from the Maniatis Library through use 
of a "full length" probe including a complete 621 base 
'25 pair- fragment of a previously-isolated cONA sequence for 
the alpha subunit. As another example, Das, et al., 
P.N.A.S. (U.S.A.) , 80, pp. 1531-1535 (1983) report isola- 
tion of human genomic clones for- human HLA-DR using a 175 
base pair synthetic oligonucleotide. Finally, Anderson, 
30 et al., P.N.A.S. (U.S.A.), 80, pp. 6838-6842 (1983) 
report the isolation of genomic clone for bovine 
pancreatic trypsin inhibitor (BPTI) using a single probe 
86 base pairs in length and constructed according to the 
known amino acid sequence of BPTI. The authors note a 
35 determination- of poor prospects for isolating rnRNA 

suitable for synthesis of a cDNA library due to apparent 
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low levels of mRNA in initially targeted parotid gland 
and lung tissue sources and then address the prospects of 
success in probing a genomic library using a mixture of 
labelled probes, stating: "More generally, mixed- 
5 sequence oligodeoxynucleotide probes have been used to 
isolate protein genes of unknown sequence from cDNA 
libraries. Such probes are typically mixtures of 8-32 
oligonucleotides, 14-17 nucleotides in length, repre- 
senting every possible codon combination for a small 

10 stretch (5-6 residues) of amino acid sequence. Under 
stringent hybridization conditions that discriminate 
against incorrectly base-paired probes, these mixtures 
are capable of locating specific gene sequences in clone 
libraries of low-to-moderate complexity. Nevertheless, 

15 because of their short length and heterogeneity, mixed 
probes often lack the specificity required for probing 
sequences as complex as a mammalian genome. This makes 
such a method impractical for the isolation of mammalian 
protein genes when the corresponding mRNAs are 

20 unavailable." (Citations omitted). 

There thus continues to exist a need in the art 
for improved methods for effecting the rapid and effi- 
cient isolation of cDNA clones in instances where little 
is known of the amino acid sequence of the polypeptide 

25 coded for and where "enriched" tissue sources of mRNA are 
not readily available for use in constructing cDNA 
libraries. Such improved methods would be especially 
use'ful if they were -applicable to isolating mammalian 
genomic clones where sparse information is available con- 

30 cerning amino acid sequences of the polypeptide coded for 
by the gene sought. 

B. Erythropoietin As A Polypeptide Of Interest 

Erythropoiesis , the production of red blood 
35 cells, occurs continuously throughout the human life span 
to offset cell destruction. Erythropoiesis is a very • 
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precisely controlled physiological mechanism enabling 
sufficient numbers of red blood cells to be available in 
the blood for proper tissue oxygenation, but not so many 
that the cells would impede circulation. The formation 
5 of red blood cells occurs in the bone marrow and is under 
the control of the hormone, erythropoietin. 

Erythropoietin, an acidic glycoprotein of 
approximately 34,000 dalton molecular weight, may occur 
in three forms: a, 3 and asialo. The a and B forms 

10 differ slightly in carbohydrate components, but have the 
same potency, biological activity and molecular weight. 
The asialo form is an a or 8 form with the terminal car- 
bohydrate (sialic acid) removed. Erythropoietin is pre- 
sent in very low concentrations in plasma when the body 

15 is in a healthy state wherein tissues receive sufficient 
oxygenation from the existing number of erythrocytes. 
This normal low concentration is enough to stimulate 
replacement of red blood cells which are lost normally 
through aging. 

20 The amount of erythropoietin in the circulation 

is increased under conditions of hypoxia when oxygen 
transport by blood cells in the circulation is reduced. 
Hypoxia may be caused by loss of large amounts of blood 
through hemorrhage, destruction of red blood cells by 

25 over-exposure to radiation, reduction in oxygen intake 
due to high altitudes or prolonged unconsciousness, or 
various forms of anemia. In response to tissues 
undergoing hypoxic stress, erythropoietin will increase 
red blood cell production by stimulating the conversion 

30 of primitive precursor cells in the bone marrow into pro- 
erythroblasts which subsequently mature, synthesize 
hemoglobin and are released into the circulation as red 
blood cells. When the number of red blood cells in cir- 
culation is greater than needed for normal tissue oxygen 

35 requirements, erythropoietin in circulation is decreased 
See generally, Testa, et al.-. Exp.Hematol . , 
8C5upp. 8), 144-152 C1980); Tong, et al., J.Biol .Chem . , 
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256C24J, 12666-12672 (1981); Goldwasser, J .Cell .Physiol . , 
110(Supp. 1), 133-135 (1982); Finch, Blood , 60(6) , 
1241-1246 (1982); Sytowski, et al . , Expt .Hematol . , 8(Supp 
8_), 52-64 (1980: Naughton, Ann .Clin .Lab .Sci. , 13(5) , 
5 432-438 (1983); Weiss, et al., Am.J.Vet.Res. , 

44(10) ,1832-1835 (1983); Lappin, et al., Exp. Hematol . t 
11(7) , 661-666 (1983); Baciu, et al . , Ann.N .Y.A cad.Sci. , 
414, 66-72 (1983); Murphy, et al . , Acta .Haem atologica 
Japonica , 46(7) , 1380-1396 (1983); Oessypris, et al . , 

10 Brit. J.Haematol. , 56, 295-306 (1984); and, Emmanouel, et 
al., Am. J. Physiol. , 247 (1 Pt 2) , F168-76 (1984). 

Because erythropoietin is essential in the pro- 
cess of red blood cell formation, the hormone has poten- 
tial useful application in both the diagnosis and the 

15 treatment of blood disorders characterized by low or 
defective red blood cell production. See, generally, 
Pennathur-Das, et al., Blood , 63(5) , 1168-71 (1984) and 
Haddy, Am. Jour. Ped. Hematol. /Oncol. , 4, 191-196, (1982) 
relating to erythropoietin in possible therapies for 

20 sickle cell disease, and Eschbach, et al. J .Clin . Invest . , 
74(2) , pp. 434-441, (1984), describing a therapeutic 
regimen for uremic sheep based on in vivo response to 
erythropoietin-rich plasma infusions and proposing a 
dosage of 10 U EPO/kg per day for 15-40 days as correc- 

25 tive of anemia of the type associated with chronic renal 
failure. See also, Krane, Henry Ford Hosp.Med. J . , 31(3), 
177-181 (1983). 

It has recently been estimated that the availa- 
bility of erythropoietin in quantity would allow for . 

30 treatment each year of anemias of 1,600,000 persons in 
the United States alone. See, e.g. ,■ Morrison , 
"Bioprocessing in Space — an Overview", pp. 557-571 in 
The World Biotech Report 1984, Volume 2:USA, (Online 
Publications, New York, N.Y. 1984). Recent studies have 
. 35 provided a basis for projection of efficacy of erythro- 
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poietin therapy in a variety of disease states, disorders 
and states of hematologic irregularity: Vedovato, et 
al., Acta. Haematol , 71, 211-213 (1984) 
(beta-thalassemia); Vichinsky, et al., J.Pediatr. , 
5 105(1) t 15-21 (1984) (cystic fibrosis); Cotes, et al., 
Brit.J.Obstet.Gvneacol. , 90(4) , 304-311 (1983) 
(pregnancy, menstrual disorders); Haga, et al., 
Acta.Pediatr.Scand. , 72, 827-831 (1983) (early anemia of 
prematurity); Claus-Walker , et al., 
10 Arch. Phvs. Med. Rehabll." , 65, 370-374 (1984) (spinal cord 
injury); Dunn, et al., Eur .J. App l. Physiol., 52, 178-182 
(1984) (space flight); Miller, et al., Brit . J .Haematol . , 
52, 545-590 (1982) (acute blood loss); Udupa, et al., 
J. Lab. CI in. Med. , 103(4) , 574-580 and 581-588 (1984); and 
15 Lipschitz, et al., Blood , 63(3) , 502-509 (1983) (aging); 
and Dainiak, et al., Cancer , 51(6) , 1101-1106 (1983) and 
Schwartz, et al., Otolaryngol. , 109, 269-272 (1983) 
(various neoplastic disease states accompanied by abnor- 
mal erythropoiesis ) . 
20 Prior attempts to obtain erythropoietin in good 

yield from plasma or urine have proven relatively unsuc- 
cessful. Complicated and sophisticated laboratory tech- 
niques are necessary and generally result in the 
collection of very small amounts of impure and unstable 
25 extracts containing erythropoietin. 

U.S. Letters Patent No. 3,033,753 describes a 
method for partially purifying erythropoietin from sheep 
. blood' plasma which provides low yields of a crude solid 
extract containing erythropoietin. • 
30 Initial attempts to isolate erythropoietin from 

urine yielded unstable, biologically inactive prepara- 
tions of the hormone. U.S. Letters Patent No. 3,865,801 
describes a method of stabilizing the biological activity 
of a crude substance containing erythropoietin recovered 
35 from urine. The resulting crude preparation containing 
erythropoietin purportedly .retains 90% of erythropoietin 
activity, and is stable. 
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Another method of purifying human erythropoietin 
from urine of patients with aplastic anemia is described 
in Miyake, et al., J . Biol .Chem . , Vol. 252, No. 15 (August 
10, 1977), pp. 5558-5564. This seven-step procedure 
5 includes ion exchange chromatography, ethanol precipita- 
tion, gel filtration, and adsorption chromatography, and 
yields a pure erythropoietin preparation with a potency 
of 70,400 units/mg of protein in 2135 yield. 

U.S. Letters Patent No. 4,397,840 to Takezawa, 

10 et al. describes methods for preparing "an erythropoietin 
product" from healthy human urine specimens with weakly 
basic ion exchangers and proposes that the low molecular 
weight products obtained "have no inhibitory effects 
against erythropoietin. 

15 U.K. Patent Application No. 2,085,887 by 

Sugimoto, et al . , published May 6, 1982, describes a pro- 
cess for the production of hybrid human lymphoblastoid 
cells, reporting production levels ranging from 3 to 420 
Units of erythropoietin per ml of suspension of cells 

■20 (distributed into the cultures after mammalian host propaga 
tion containing up to 10 7 cells per ml. At the highest pro 
duction levels asserted to have been obtained, the rate 
of erythropoietin production could be calculated to be 
from 40 to about 4,000 Units/10 6 cells/48 hours in in 

25 vitro culture following transfer of cells from in vivo 
propagation systems. (See also the equivalent U.S. 
Letters Patent No. 4,377,513.) Numerous proposals have 
been made for isolation of erythropoietin from tissue 
. sources, including neoplastic cells, but the yields have 

30 been quite low. See, e.g., Jelkman, et al . , 

Exot.Hematol. , 11(7) , 581-588 (1983); Tambourin, et al . , 
P.N.A.S. (U.S.A.) , 80, 6269-6273 (1983); Katsuoka, et 
al., Gann, 74, 534-541 (1983); Hagiwara, et al . , Blood, 
63(4) , 828-835 (1984); and Choppin, et al., Blood ,, 64(2) , 

35 341-347 (1984). 

Other isolation techniques utilized to obtain 
purified erythropoietin involve immunological procedures. 
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A polyclonal, serum-derived antibody directed against 
erythropoietin is developed by injecting an animal, pre- 
ferably a rat or rabbit, with human erythropoietin. The 
injected human erythropoietin is recognized as a foreign 
5 antigenic substance by the immune system of the animal 
and elicits production of antibodies against the antigen. 
Differing cells responding to stimulation by the antige- 
nic substance produce and release into circulation anti- 
bodies slightly different from those produced by other 
10 responding cells. The anti-body activity remains in the 
serum of the animal when its blood is extracted. While 
unpurified serum or antibody preparations purified as a 
serum immunoglobulin G fraction may then be used in 
assays to detect and complex with human erythropoietin, 
15 the materials suffer from a major disadvantage. This 

serum antibody, composed of all the different antibodies 
produced by individual cells, is polyclonal in nature and 
will complex with components in crude extracts other than 
erythropoietin alone . 
20 Of interest to the background of the present 

invention are recent advances in the art of developing 
continuous cultures of cells capable of producing a 
single species of antibody which is specifically immuno- 
logically reactive with a single antigenic determinant of 
25 a selected antigen. See, generally, Chisholm, High 

Technology , Vol. 3, No. 1, 57-63 C1983). Attempts have 
been made to employ cell fusion and hybridization tech- 
niques to develop "monoclonal" antibodies to erythro- - 
poietin and to employ these * antibodies in the isolation 
30 and quantitative detection of human erythropoietin. As 
one example, a report of the successful development of 
mouse-mouse hybridoma cell lines secreting monoclonal 
antibodies to human erythropoietin appeared in abstract 
form in Lee-Huang, Abstract No. 1463 of Fed.Proc . , 41, 
35 520 (1982). As another example, a detailed description 



WO 85/02610 PCT/US84/02021 

- 15 - 

of the preparation and use of a monoclonal, anti- 
erythropoietin antibody appears in Weiss, et al., 
P.N.A.S. (U.S.A.) , 79, 5465-5469 (1982). See also, 
Sasaki, Biomed.Biochim.Acta. , 42(11/12) , S202-S206 
5 (1983); Yanagawa, et al., Blood , 64(2) , 357-364 (1984); 
Yanagawa, et al . , 3 . Biol .Chem . , 259(5) , 2707-2710 (1984); 
and U.S. Letters Patent No. 4,465,624. 

Also of interest to the background of the inven- 
tion are reports of the immunological activity of synthe- 

10 tic peptides which substantially duplicate the amino acid 
sequence extant in naturally-occurring proteins, 
glycoproteins and nucleoproteins . More specifically, 
relatively low molecular weight polypeptides have been 
shown to participate in immune reactions which are simi- 

15 lar in duration and extent to the immune reactions of 

physiologically significant proteins such as viral anti- 
gens, polypeptide hormones, and the like. Included among 
the immune reactions of such polypeptides is the provoca- 
tion of the formation of specific antibodies in 

20 immunologically active animals. See, e.g., Lerner, et 

al., Cell , 23, 309-310 (1981); Ross, et al . , Nature , 294 , 
654-656 (1981); Walter, et al., P.N.A.S. (U.S.A.) , 77, 
5197-5200 (1980); Lerner, et al., P.N.A.S. (U.S.A. ) , 78, 
3403-3407 (1981); Walter, et al., P.N.A.S. (U.S.A. ) , 78 , 

25 4882-4886 (1981); Wong, et al . , P.N.A.S. (U.S.A.) , 78, 

7412-7416 (1981); Green, et al. Cell , 28, 477-487 (1982); 
Nigg, et al., P.N.A.S. (U.S.A.) , 79, 5322-5326 (1982); 
Baron, et 'al., Cell , 28 , 395-404 (1.982); Dreesman, et 
al., Nature , 29 5 , 158-160 (1982); and Lerner, Scientific 

30 American , 248 , No. 2, 66-74 (1983). See, also, Kaiser, 
et al., Science , 223 , pp. 249-255 (1984) relating to 
biological and immunological activities of synthetic pep- 
tides which approximately share secondary structures of 
peptide hormones but may not share their primary struc- 

35 tural conformation. The above studies relate, of course, 
to amino acid sequences of proteins other than erythro- 
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poietin, a substance for which no substantial, amino acid 
sequence information has been published. In co-owned, 
co-pending U.S. Patent Application Serial No. 463,724, 
filed February 4, 1983, by J. Egrie, published August 22, 
5 1984 as European Patent Application No. 0 116 446, there 
is described a mouse-mouse hybridoma cell line 
(A.T.C.C. No. HB8209) which produces a highly specific 
monoclonal, anti-erythropoietin antibody which is also 
specifically immunoreactive with a polypeptide comprising 
10 the following sequence of amino acids: 

NH 2 -Ala-Pro-Pro-Arg-Leu-Ile-Cys-Asp-Ser-Arg-:Val-Leu- 

Glu-Arg-Tyr-Leu-Leu-Glu-Ala-Lys-COOH. 

The polypeptide sequence is one assigned to the first 

twenty amino acid residues of mature human erythropoietin 

15 isolated according to the method of Miyake, et al . , 

j'.Biol.Chem. , 252, 5558-5564 (1977) and upon which amino 
acid analysis was performed by the gas phase sequencer 
(Applied Biosystems, Inc.) according to the procedure of 
Hewick, M., et al., J.Biol.Chem. , 256, 7990-7997 (1981). 

20 See, also, Sue, et al., Proc. Nat. Acad. Sci. (USA),, 80, 
pp. 3651-3655 (1983) relating to development of polyclo- 
nal antibodies against a synthetic 26-mer based on a dif- 
fering amino acid sequence, and Sytowski, et al., 
J.Immunol. Methods , 69, pp. 181-186 (1984). 

25 While polyclonal and monoclonal antibodies as 

described above provide highly useful materials for use 
in immunoassays, for detection and quantification of 
erythropoietin and" can be useful in the affinity purifi^ 
cation of erythropoietin , .it appears unlikely that" these 

30 materials can readily provide for the large scale isola- 
tion of quantities of erythropoietin from mammalian sour- 
ces sufficient for further analysis, clinical testing and 
potential wide-ranging therapeutic use of the substance 
in treatment of, e.g., chronic kidney disease wherein 

35 diseased tissues fail to sustain production of erythro- 
poietin. It is consequently projected in the art that 
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the best prospects for fully characterizing mammalian 
erythropoietin and providing large quantities of it for 
potential diagnostic and clinical use involve successful 
application of recombinant procedures to effect large 
5 scale microbial synthesis of the compound. 

While substantial efforts appear to have been 
made in attempted isolation of DNA sequences coding for 
human and other mammalian species erythropoietin, none 
appear to have been successful. This is due principally 

10 to the scarcity of tissue sources, especially human 

tissue sources, enriched in mRNA such as would allow for 
construction of a cDNA library from which a DNA sequence 
coding for erythropoietin might be isolated by conven- 
tional techniques. Further, so little is known of the 

15 continuous sequence of amino acid residues of erythro- 
poietin that it is not possible to construct, e.g., long 
polynucleotide probes readily capable of reliable use in 
DNA /DNA hybridization screening of cDNA and especially 
genomic DNA libraries. Illustratively, the twenty amino 

20 acid sequence employed to generate the above-named 

monoclonal antibody produced by A.T.C.C. No. HB8209 does 
not admit to the construction of an unambiguous, 60 base 
oligonucleotide probe in the manner described by 
Anderson, et al., supra . It is estimated that the human 

25 gene for erythropoietin may appear as a "single copy 
gene" within the human genome and, in any event, the 
genetic material coding for human erythropoietin is 
likely to constitute less than 0. 000053$ of total human 
genomic DNA which would be present in a genomic library. 

30 To date, the most successful of known reported 

attempts at recombinant-related methods to provide DNA 
sequences suitable for use in microbial expression of 
isolatable quantities of mammalian erythropoietin have 
fallen far short of the goal. As an example, Farber, et 

35 al. Exp.Hematol. , 11. Supp. 14, Abstract 101 (1983) 
report the extraction of mRNA from kidney tissues of 
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phenylhydrazine-treated baboons and the injection of the 
mRNA into Xenopus laevis oocytes with the rather tran- 
sitory result of i£ vitro production of a mixture of 
w translation products" which included among them 
5 displaying biological properties of erythropoietin. More 
recently, Farber, et al., Blood , 62 , No. 5, Supp. No. 1, 
Abstract 392, at page 122a (1983) reported the in vitro 
translation of human kidney mRNA by frog oocytes. The 
resultant translation product mixture was estimated to 

10 include on the order of 220 mU of a translation product 
having the activity of erythropoietin per microgram of 
injected mRNA. While such levels of in vitro translation 
of exogenous mRNA coding for erythropoietin were 
acknowledged to be quite low (compared even to the prior 

15 reported levels of baboon mRNA translation into the 

sought-for product) it was held that the results confirm 
the human kidney as a site of erythropoietin expression, 
allowing for the construction of an enriched human kidney 
cDNA library from which the desired gene might be iso- 

20 lated. [See also, Farber, Clin. Res. , 31(4) , 769A 
(1983).] 

Since the filing of U.S. Patent Application 
Serial Nos. 561,024 and 582,185, there has appeared a 
single report of the cloning and expression of what is 

25 asserted to have been human erythropoietin cDNA in 
E.coli . Briefly put, a number of cDNA clones were 
inserted into E.coli plasmids and 8-lactamase fusion pro- 
ducts were noted to be immunoreactive with a monoclonal 
antibody, to an unspecified *' epitope" of human erythro- 

30 poietin. See, Lee-Huang, Proc. Nat. Acad* Sci. (USA) , 
81, pp. 2708-2712 (1984). 

BRIEF SUMMARY 



35 The present invention provides, for the first 

time, novel purified and isolated polypeptide products 
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having part or all of the primary structural conformation 
(i.e., continuous sequence of amino acid residues) and 
one or more of the biological properties (e.g., immunolo- 
gical properties and in vivo and in vitro biological 
5 activity) of naturally-occurring erythropoietin, 

including allelic variants thereof. These polypeptides 
are also uniquely characterized by being the product of 
procaryotic or eucaryotic host expression- (e.g. , by bac- 
terial, yeast and mammalian cells in culture) of exoge- 

10 nous DNA sequences obtained by genomic or cDNA cloning or 
by gene synthesis. Products of microbial expression in 
vertebrate (e;g., mammalian and avian) cells may be 
further characterized by freedom from association with 
human proteins or other contaminants which may be asso- 

15 ciated with erythropoietin in its natural mammalian 

cellular environment or in extracellular fluids such as 
plasma or urine. The products of typical yeast (e.g., 
Saccaromyces cerevisiae ) or procaryote (e.g., E^coli) 
host cells are free of association with any mammalian 

20 proteins. Depending upon the host employed , polypeptides 
of the invention may be glycosylated with mammalian or 
other eucaryotic carbohydrates or may be non- 
glycosylated. Polypeptides of the invention may also 
include an initial methionine amino acid residue (at 

25 position -1 ) . 

Novel glycoprotein products of the invention 
include those having a primary structural conformation 
sufficiently duplicative of that of a naturally-occurring 
(e.g., human) erythropoietin to allow possession of one 

30 or more of the biological properties thereof and having 
an average carbohydrate composition which differs from 
that of naturally-occurring (e.g., human) erythropoietin. 

Vertebrate (e.g., COS-1 and CHO) cells provided 
by the present invention comprise the first cells ever 

35 available which can be propagated in vitro continuously 
and which upon growth in culture are capable of producing 
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in the medium of their growth in excess of 100U 
[preferably in excess of 500U and most preferably in 
excess of 1,000 to 5,000U) of erythropoietin per 
10 6 cells in 48 hours as determined by radioimmunoassay. 
5 Also provided by the present invention are 

synthetic polypeptides wholly or partially duplicative of 
continuous sequences of erythropoietin amino acid resi- 
dues which are herein for the first time elucidated. 
These sequences, by virtue of sharing primary, secondary 
10 or tertiary structural and conformational characteristics 
with naturally-occurring erythropoietin may possess 
biological activity and/or immunological properties in 
common with the naturally-occurring product such that 
they may be employed as biologically active or immunolo- 
15 gical substitutes for erythropoietin in therapeutic and 
immunological processes. Correspondingly provided are 
monoclonal and polyclonal antibodies generated by stan- 
dard means which are immunoreactive with such polypep- 
tides and, preferably, also immunoreactive with 
20 naturally-occurring erythropoietin. 

Illustrating the present invention are cloned 
DNA sequences of monkey and human species origins and 
polypeptide sequences suitably deduced therefrom which\ 
represent, respectively, the primary structural confor- 
25 mation of erythropoietins of monkey and human species 
origins. 

Also provided by the present invention are novel 
biologically functional viral and circular plasmid DNA 
vectors incorporating DNA sequences of the invention and . 

30 microbial (e.g., bacterial, yeast and mammalian cell) 
host organisms stably transformed or transfected with 
such vectors. Correspondingly provided by the invention 
are .novel methods for the production of useful polypep- 
tides comprising cultured growth of such transformed or 

35 transfected microbial hosts under conditions facilitative 
of large. scale expression of the exogenous, vector-borne 
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DNA sequences and isolation of the desired polypeptides 
from the growth medium, cellular lysates or cellular 

membrane fractions. 

Isolation and purification of microbially 
5 expressed polypeptides provided by the invention may be 
by conventional means including, e.g., preparative chro- 
matographic separations and immunological separations 
involving monoclonal and/or polyclonal antibody prepara- 
tions. 

10 Having herein elucidated the sequence of amino 

acid residues of erythropoietin, the present invention 
• provides for the total and/or partial manfucture of .DNA 
sequences coding for erythropoietin and including such 
advantageous characteristics as incorporation of co-dons 

15 •■preferred" for expression by selected non-mammalian 
hosts, provision of sites for cleavage by restriction 
endonuclease enzymes and provision of additional initial, 
terminal or intermediate DNA sequences which facilitate 
construction of readily expressed vectors. Corres- 

20 pondingly, the present invention provides for manufacture 
(and development by site specific mutagenesis of cDNA and 
genomic DNA ) of DNA sequences coding for microbial 
expression of polypeptide analogs or derivatives of 
erythropoietin which differ from naturally-occurring 

25 forms in terms of the identity or location of one or more 
amino acid residues (i.e., deletion analogs containing 
less than all of the residues specified for EPO and/or 
substitution analogs wherein one or more residues, spe- 
cified are replaced by other residues and/or addition 

30 analogs wherein one or more amino acid residues is added 
to a terminal or medial portion of the polypeptide); and 
which share some or all the properties of naturally- 
occurring forms. 

Novel DNA sequences of the invention include all 
35 sequences useful in securing expression in procaryotic or 
eucaryotic host cells of polypeptide products having at 
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least a part of the primary structural conformation and 
one or more of the biological properties of erythro- 
poietin which are comprehended by: (a) the DNA sequences 
set out in Tables V and VI herein or their complementary 
5 strands; (b) DNA sequences which hybridize (under hybri- 
dization conditions such as illustrated herein or more 
stringent conditions) to DNA sequences defined in (a) or 
fragments thereof; and (c) DNA sequences which, but for 
the degeneracy of the genetic code, would hybridize to 

10 DNA sequences defined in (a) and (b) above. Specifically 
comprehended in part (b) are genomic DNA sequences 
encoding allelic variant forms of monkey -and human 
erythropoietin and/or encoding other mammalian species of 
erythropoietin. Specifically comprehended by part (c) 

15 are manufactured DNA sequences encoding EPO, EPO 

fragments and EPO analogs which DNA sequences may incor- 
porate codons facilitating translation of messenger RNA 
in non-vertebrate hosts. 

Comprehended by the present invention is that 

20 class of polypeptides coded for by portions of the DNA 
complement "to the top strand human genomic DNA sequence 
of Table VI herein, i.e., "complementary inverted pro- 
teins 1 * as described by Tramontano, et al., Nucleic Acids 
Research , 12 , pp. 5049-5059 (1984). 

25 Also comprehended by the invention are phar- 

maceutical compositions comprising effective amounts of 
polypeptide products of the invention together with 
suitable diluents, adjuvants and/or carriers which allow 
for provision of erythropoietin therapy, especially, in 

30 the treatment of anemic disease states and most espe- 
cially such anemic states as attend chronic renal 
failure . 

Polypeptide products of the invention may be 
"labelled" by covalent association with a detectable 
35 marker substance (e.g., radiolabeled with * I) to pro- 
vide reagents useful in detection and quantification of 
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erythropoietin in solid tissue and fluid samples such as 
blood. or urine. DNA products of the invention may also 
be labelled with detectable markers (such as radiolabels 
and non-isotopic labels such as biotin) and employed in 
5 DNA hybridization processes to locate the erythropoietin 
gene position and/or the position of any related gene 
family in the human, monkey and other mammalian species 
chromosomal map. They can also be used for identifying 
the erythropoietin gene disorders at the DNA level and 

10 used as gene markers for identifying neighboring genes 
and their disorders. 

' As hereinafter described in detail, the present 
invention further provides significant improvements in 
methods for detection of a specific single stranded poly- 

15 nucleotide of unknown sequence in a heterogeneous cellu- 
lar or viral sample including multiple single-stranded 
polynucleotides where 

(a) a mixture of labelled single-stranded poly- 
nucleotide probes is prepared having uniformly varying 

20 sequences of bases, each of said probes being potentially 
specifically complementary to a sequence of bases which 
is putatively unique to the polynucleotide to be 
detected , 

(b) the sample is fixed to a solid substrate, 
25 (c) the substrate having the sample fixed 

thereto is treated to diminish further binding of poly- 
nucleotides thereto except by way of hybridization to 
polynucleotides in said sample, 

_(d) the tr.eated substrate having- -tfcue sampJLe 

30 " fixed thereto is transitorily contacted with said mixture 
of labelled probes under conditions facilitative of 
hybridization only between totally complementary poly- 
nucleotides, and, 

(e) the specific polynucleotide is detected by 

35 monitoring for the presence of a hybridization reaction 
between it and a totally complementary probe within said 
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mixture of labelled probes, as evidenced by the presence 
of a higher density of labelled material. on the substrate 
at the locus of the specific polynucleotide in comparison 
to a background density of labelled material resulting 
5 from non-specific binding of labelled probes to the 
substrate . 

- The procedures are especially effective in 
situations dictating use of 64, 128, 256, 512, 1024 or 
more mixed polynucleotide probes having a length of 17 to 

10 20 bases in DNA/DNA or RNA/RNA or DNA/RNA hybridizations* 
As described infra , the above-noted improved 
procedures have illustratively allowed for the iden- 
tification of cDNA clones coding for erythropoietin of 
monkey species origins within a library prepared from 

15 anemic monkey kidney cell mRNA. More specifically, a 
mixture of 128 uniformly varying 20-mer probes based on 
amino acid sequence information derived from sequencing 
fractions of human erythropoietin was employed in colony 
hybridization procedures to identify seven "positive" 

20 erythropoietin cDNA clones within a total of 200,000 

colonies. Even more remarkably, practice of the improved 
procedures of the invention have allowed for the rapid 
isolation of three positive clones from within a 
screening of 1,500,000 phage plaques constituting a human 

25 genomic library,. This was accomplished through use of 
the above-noted mixture of 128 20-mer probes together 
with a second set of 128 17-mer probes based on amino 
acid analysis of a different continuous sequence of human 
-erythropoietin. _ 

30 The above-noted illustrative procedures consti- 

tute the first known instance of the use of multiple 
mixed oligonucleotide probes in DNA/DNA hybridization 
processes directed toward isolation of mammalian genomic 
clones and the first known instance of the use of a mix- 

35 ture of more than 32 oligonucleotide probes in the isola- 
tion of cDNA clones. 
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Numerous aspects and advantages of the invention 
will be apparent to those skilled in the art upon 
consideration of the following detailed description which 
provides illustrations of the practice of the invention 
5 in its presently preferred embodiments. 

DETAILED DESCRIPTION 

According to the present invention, DNA 

10 sequences encoding part or all of the polypeptide 
sequence of human and monkey species erythropoietin 
(hereafter, at times, ,, EP0 U ) have been isolated and 
characterized. Further, the monkey and human origin DNA 
has been made the subject of eucaryotic and procaryotic 

15 expression providing isolatable quantities of polypep- 
tides displaying biological (e.g., immunological) proper- 
ties of naturally-occurring EPO as well as both JLn vivo 
and in vitro biological activities of EPO. 

The DNA of monkey species origins was isolated 

20 from a cDNA library constructed with mRNA derived from 
kidney tissue of a monkey in a chemically induced anemic 
state and whose serum was immunologically determined to 
include high levels of EPO compared to normal monkey 
serum. The isolation of the desired cDNA clones con- 

25 taining EPO encoding DNA was accomplished through use of 
DNA/DNA colony hybridization employing a pool of 128 
mixed, radiolabeled , 20-mer oligonucleotide probes and - 
involved the rapid .sereening of 200,000 colonies. 'Design, 
of the oligonucleotide probes was based on amino acid 

30 sequence information provided by enzymatic fragmentation 
and sequencing a small sample of human EPO. 

The DNA of human species origins was isolated 
from a human genomic DNA library. The isolation of 
clones containing EPO-encoding DNA was accomplished 

35 through DNA/DNA plaque hybridization employing the above- 
noted pool of 128 mixed 20-mer oligonucleotide probes and 
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a second pool of 128 radiolabeled 17-mer probes whose 
sequences were based on amino acids sequence information 
obtained from a different enzymatic human EPO fragment. 
Positive colonies and plaques were verified by 
5 means of dideoxy sequencing of clonal DNA using a subset 
of 16 sequences within the pool of 20-mer probes and 
selected clones were subjected to nucleotide sequence 
analysis resulting in deduction of primary structural 
conformation of the EPO polypeptides encoded thereby. 
10 The deduced polypeptide sequences displayed a high degree 
of homology to each other and to a partial sequence 
generated by amino acid analysis of human EPO fragments. 

A selected positive monkey cONA clone and a 
selected positive human genomic clone were each inserted 
15 in a "shuttle" DNA vector which was amplified in E.coli 
and employed to transfect mammalian cells in culture. 
Cultured growth of transfected host cells resulted in 
culture medium supernatant preparations estimated to con- 
tain as much as 3000 mU of EPO per ml of culture fluid. 
20 The following examples are presented by way of 

illustration of the invention and are specifically 
directed to procedures carried out prior to iden- 
tification of EPO encoding monkey cONA clones and human 
genomic clones, to procedures resulting in such iden- 
25 tification, and to the sequencing, development of 

expression systems and immunological verification of EPO 
expression in such systems. 

. More particularly, Example 1 is* "directed to 
amino, acid sequencing of humair EPO fragments and con- 
30 struction of mixtures of radiolabeled probes based on 
the results of this sequencing. Example 2 is generally 
directed to procedures involved in the identification of 
positive monkey cONA clones and thus provides information 
concerning animal treatment and preliminary radioim- 
35 munoassay (RIA) analysis of animal sera. Example 3 is 
directed to the preparation of ■ the cDNA library, colony 
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10 



hybridization screening and verification of positive 
clones, DNA sequencing of a positive cDNA clone and the 
generation of monkey EPO polypeptide primary structural 
conformation (amino acid sequence) information. Example 
4 is directed to procedures involved in the iden- 
tification of positive human genomic clones and thus pro- 
vides information concerning the source of the genomic 
library, plaque hybridization procedures and verification 
of positive clones. Example 5 is directed to DNA 
sequencing of a positive genomic clone and the generation 
of human EPO polypeptide amino acid sequence information 
including a comparison thereof to the monkey EPO sequence 
information. Example 6 is directed to procedures for 
construction of a vector incorporating EPO-encoding DNA 
15 derived from a positive monkey cDNA clone, the use of the 
vector for transfection of COS-1 cells and cultured 
growth of the transfected cells. Example 7 is directed 
to procedures for construction of a vector incorporating 
EPO-encoding DNA derived from a positive human genomic 
20 clone, the use of the vector for transfection of COS-1 
cells and the cultured growth of the transfected cells. 
Example 8 is directed to immunoassay procedures performed 
on media supernatants obtained from the cultured growth 
of transfected cells according to Example 6 and 7. 
25 Example 9 is directed to in vitro and in vivo biological 
activity of microbially expressed EPO of Examples 6 and 
7. 

Example 10 is directed to a development of mam- 
malian host expression systems for monkey species EPO 

30 cDNA and human species genomic ONA involving Chinese 

hamster ovary ("CHO") cells and to the immunological and 
biological activities of products of these expression 
systems as well as characterization of such products. 
Example 11 is directed to the preparation of manufactured 

35 genes encoding human species EPO and EPO analogs, which 
genes include a number of preference codons for 
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expression in S.coli and yeast host cells, and to 
expression systems based thereon. Example 12 relates to 
the immunological and biological activity profiles of 
expression products of the systems of Example 11. 

5 

EXAMPLE 1 

A. Human EPQ Fragment Amino Acid Sequencing 

Human EPO was isolated from urine and subjected 

10 to tryptic digestion resulting in the development and 

isolation of 17 discrete fragments in quantities approxi- 
mating 100-150 picomoles. 

Fragments were arbitrarily assigned numbers and 
were analyzed for amino acid sequence by microsequence 

15 analysis using a gas phase sequencer (Applied Biosystems) 
to provide the sequence information set out in Table I, 
below, wherein single letter codes are employed and M X M 
designates a residue which was not unambiguously deter- 
mined. 



25 



30 
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TABLE I 



Fragment No. Sequence Analysi s Result 

5 TAa A-P-P-R 

T4b G-K-L-K 

T9 A-L-G-A-Q-K 

T13 V-L-E-R 

T16 • A-V-S-G-L-R 

10 T18 L-F-R 

T21 K-L-F-R 

T25 Y-L-L-E-A-K 

T26a l-I-C-D-S-R 

T2 6b L-Y-T-G-E-A-C-R 

15 T27 T-I-T-A-D-T-F-R 

T28 E-A-I-S-P-P-D-A-A-M-A-A-P-L-R 

T30 E-A-E-X-I-T-T-G-X-A-E-H-X-S-L 

N-E-X-I-T-V-P 

T31 y-Y-S-N-F-L-R 

20 T33 S-L-T-T-L-L-R 

T35 V-N-F-Y-A-W-K 

T38 G-Q-A-L-L-V-X-S-S-Q-P-W- 

E-P-L-Q-L-H-V-O-K 



25 



30 



35 
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B. Design and Construction of 

Oligonucleotide Probe Mixtures 

The amino acid sequences set out in Table I were 
reviewed in the context of the degeneracy of the genetic 
5 code for the purpose of ascertaining whether mixed probe 
procedures could be applied to DNA/DNA hybridization pro- 
cedures on cDNA and/or genomic DNA libraries. This ana- 
lysis revealed that within Fragment No. T35 there existed 
a series of 7 amino acid residues 

10 (Val-Asn-Phe-Tyr-Ala-Trp-Lys) which could be uniquely 
characterized as encoded for by one of 128 possible DNA 
sequences spanning 20 base pairs. A first set of 128 
20-mer oligonucleotides was therefore synthesized by 
standard phosphoamidite methods CSee, e.g., Beaucage, et 

15 al., Tetrahedron Letters , 22, pp. 1859-1862 (1981) on a 
solid support according to the sequence set out in Table 
II, below. 







TABLE II 




20 










Residue - Val - Asn 


Phe Tyr Ala 


Trjp. Lyj. 




3' CAA TTG 


AAG ATG CGA 


ACC TT - 5' 




T A 


A A T 






G 


G 




25 


C 


C 






Further analysis revealed that within fragment 




Noi T38 there existed a series of 6 amino acid residues 




(Gln-Pro-Trp-Glu-Pro 


-Leu) on the basis 


of which there 




could be prepared a 


pool of 128 mixed 


olignucieotide 


30 


17-mer probes as set 


out in Table III, 


below. 






TABLE III 






Residue - Gin Pro 


Trp Glu Pro 


Leu 


35 


3' GTT GGA 


ACC CTT GGA 


GA - 5' 




C T 


C T 


A 




G 


G 






C 


C 
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Oligonucleotide probes were labelled at the 5* 
end with gamma - 32 P-ATP, 7500-8000 Ci/mmole (ICN) using 
T 4 polynucleotide kinase (NEN). 

• 5 EXAMPLE 2 

A. Monkey Treatment Procedures and R IA Analysis 

Female Cynomolgus monkeys Mac a da fascicularias 
(2.5-3 kg, 1.5-2 years old) were treated subcutaneously 

10 with a pH 7.0 solution of phenylhydrazine hydrochloride 
at a dosage level of 12.5 mg/kg on days 1, 3 and 5. The 
hematocrit was monitored prior to each injection. On day 
7, or whenever the hematocrit level fell below 25% of the 
initial level, serum and kidneys, were harvested after 

15 administration of 25 mg/kg doses of ketamine hydroch- 
loride. Harvested materials were immediately frozen in 
liquid nitrogen and stored at -70*C. 



B. RIA for EPO 

20 Radioimmunoassay procedures applied for quan- 

titative detection of EPO in samples were conducted 
according to the following procedures: 

An erythropoietin standard or unknown sample was 
incubated together with antiserum for two hours at 37-C. 

25 After the two hour incubation, the sample tubes were 
cooled on ice, 125 I-labelled erythropoietin was added, 
and the tubes were incubated at 0*C for at least 15 more 
hours. Each assay tube contained 500 yl of incubation 
mixture consisting of 50 yl of diluted • immune sera, 

30 10,000 cpm of 125 I-erythropoietin , 5 yl trasylol and 

0-250 yl of either EPO standard or unknown sample, with 
P8S containing 0.1% BSA making up the remaining volume. 
The antiserum used was the second test bleed of a rabbit 



35 
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immunized with a 1% pure preparation of human urinary 

erythropoietin. The final antiserum dilution on the 

125 

assay was adjusted so that the antibody-bound I-EPO 
did not exceed 10-20% of the input total counts. In 
5 general, this corresponded to a final antiserum dilution 
of from 1:50,000 to 1:100,000. 

The antibody-bound 1 ? 5 I-erythropoietin was pre- 
cipitated by the addition of 150 ul Staph A. After a 40 
min. incubation, the samples were centrifuged and the 

10 pellets were washed two times with 0.75 ml 10 mM Tris-HCl 
pH 8.2 containing 0.15M NaCl, 2mM EDTA, and 0.05% Triton 
X-100. The washed pellets were counted in a gamma 
counter to determine the percent of 125 I-er ythropoietin 
bound. Counts bound by pre-immune sera were subtracted 

15 from all final values to correct for nonspecific precipi- 
tation. The erythropoietin content of the unknown 
samples was determined by comparison to the standard 
curve. 

The above procedure was applied to monkey serum 
20 obtained in Part A, above, as well as to the untreated 
monkey serum. Normal serum levels were assayed to con- 
tain approximately 36 mU/ml while treated monkey serum 
contained from 1000 to 1700 mU/ml. 

25 EXAMPLE 3 

A. Monkey cDNA Library Construction 

Messenger RNA was isolated from, normal and ane- 
mic-monkey kidneys by the guanidinium thiocyanate proce- 

30 dure of Chirgwin, et al . , Biochemistry , 18, p. 5294 
(1979) and poly (A) + mRNA was purified by two runs of 
oligoCdT ^-cellulose column chromatography as described at 
pp. 197-198 in Maniatis, et al., "Molecular Cloning, A 
Laboratory Manual" (Cold Springs Harbor Laboratory, Cold 

35 Springs, Harbor, N.Y., 1982). The cDNA library was con- 
structed according to a modification of the general pro- 
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cedures of Okayama, et al., Mol. and Cell. Biol., 2, 
pp. 161-170 (1982). The key features of the presently 
preferred procedures were as follows: (1) pUC8 was used 
as the sole vector, cut with Pst I and then tailed with 
5 oligo dT of 60-80 bases in length; (2) Hind i digestion 
was used to remove the oligo dT tail from one end of the 
vector; (3) first strand synthesis and oligo dG. tailing 
was carried out according to the published procedure? (4) 
BamHI digestion was employed to remove the oligo dG tail 
10 from one end of the vector; and (5) replacement of the 
RNA strand by DNA was in the presence of two linkers 
(GATCTAAAGACCGTCCCCCCCCC and ACGGTCTTTA) in a three-fold 
molar excess over the oligo dG tailed vector. 

15 B. Colony Hybridization Procedures For 

Screening Monkey cDNA Library 

Transformed E.coli were spread out at a density 
of 9000 colonies per 10 x 10 cm plate on nutrient plates 
containing 50 micrograms/ml Ampicillin. GeneScreen 

20 filters (New England Nuclear Catalog No. NEF-972) were 

pre-wet on a BHI-CAM plate (Bacto brain heart infusion 37 
g/L, Casaniino acids 2 g/L and agar 15 g/L, containing 500 
micrograms/ml Chloramphenicol) and were used to lift the 
colonies off the plate. The colonies were grown in the 

25 same medium for 12 hours or longer to amplify the plasmid 
copy numbers. The amplified colonies (colony side up) 
were treated by serially placing the filters over 2 
pieces of Whatman 3 MM paper saturated with each of the 
following solutions: 

30 (l) 50 mM glucose - 25 mM Tris-HCl (pH 8.0) - 

10 mM EDTA (pH 8.0) for five minutes; 

(2) 0.5 M NaOH for ten minutes; and 

(3) 1.0 M Tris-HCl (pH 7.5) for three minutes. 
The filters were then air dried in a vacuum over 

35 at 80*C for two hours. 

The filters were then subjected to Proteinase K 
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digestion through treatment with a solution containing 50 
micrograms/ml of the protease enzyme in Buffer K Co.lM 
Tris-HCl CpH 8.0] - 0.15M NaCl - 10 mM EDTA (pH 8.2) 
-0.2* SDS] . Specifically, 5 ml of the solution was added 
5 to each filter and the digestion was allowed to proceed 
at 55*C for 30 minutes, after which the solution was 
removed . 

The filters were then treated with 4 ml of a 
prehybridization buffer (5 x SSPE - 0.5* SDS - 100 
10 micrograms/ml SS E.coli DNA - 5 x BFP). The prehybridi- 
zation treatment was carried out at 55*C, generally for 4 
hours or longer, after which the prehybridization buffer 
was removed. 

The hybridization process was carried out in the 
15 following manner. To each filter was added 3 ml of 
hybridization buffer (5 x SSP.E - 0.5* SDS - 100 
micrograms/ml yeast tRNA) containing 0.025 picomoles of 
each of the 128 probe sequences of Table II (the total 
mixture being designated the EPV mixture) and the filters 
20 were maintained at 48 # C for 20 hours. This temperature 
was 2*C less than the lowest of the calculated disso- 
ciation temperatures CTd) determined for any of the pro- 
bes . 

Following hybridization, the filters were washed 
25 three times for ten minutes on a shaker with 6 x SSC 
-0.1* SDS at room temperature and washed two to three 
times with 6 x SSC - 1* SDS at the- hybridization tem- 
perature (48*C). 

* Autoradiography of the filters revealed seven- 
30 positive clones among the 200,000 colonies screened. 

Initial sequence analysis of one of the putative 
monkey cDNA clones (designated clone 83) was performed 
for verification purposes by a modification of the proce- 
dure of Wallace, et al., Gene, 16., pp. 21-26 (1981). 
35 Briefly, plasmid ONA from monkey cDNA clone 83 was 
linearized by digestion with EcoR I and denatured by 
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heating in a boiling water bath. The nucleotide sequence 
was determined by the dideoxy method of Sanger, et al., 
P.N.A.S. (U.S.AJ , 74, pp. 5463-5467 (1977). A subset of 
the EPV mixture of probes consisting of 16 sequences was 
5 used as a primer for the sequencing reactions. 

C. Monkey EPO cDNA Sequencing 

Nucleotide sequence analysis of clone 83 was 
carried out by the procedures of Messing, Methods in 

10 Enzymology, 101 , pp. 20-78 C1983). Set out in Table IV 
is a preliminary restriction map analysis of the approxi- 
mately 1600 base- pair EcoR I /Hind lll cloned fragment of 
clone 83. Approximate locations of restriction endo- 
nuclease enzyme recognition sites are provided in terms 

15 of number of bases 3* to the EcoR I site at the 5* end of 
the fragment. Nucleotide sequencing was carried out by 
sequencing individual restriction fragments with the 
intent of matching overlapping fragments. For example, 
an overlap of sequence information provided by analysis 

20 of nucleotides in a restriction fragment designated C113 
( Sau 3A at ~111 /Sma l at -324) and the reverse order 
sequencing of a fragment designated C73 ( Alu l at 
~424/ BstE II at -203). 
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TABLE IV 

Restriction Enzyme 
Recognition Site Approximate Location(s) 



5 


EcoRI 


1 




Sau3A 


111 




Smal 


180 




BstEII 


203 




Smal 


324 


10 


Kpnl 


371 




Rsal 


372 




. Alul 


424 




PstI 


426 




Alul 


430 


15 


Hpal 


466 




Alul 


546 




PstI 


601 




PvuII 


604 




Alul 


605 


20 


Alul 


782 




Alul 


788 




Rsal 


792 




PstI 


807 




Alul 


841 


25 


Alul 


927 




Ncol 


946 




Sau3A 


1014 




Alul 


1072 




Alul 


1115 


30 


Alul 


1223 




PstI 


1301 




Rsal 


1343 




Alul 


1384 




Hindlll 


1449 


35 


Alul 


1450 




Hindlll ■ 


1585 
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Sequencing of approximately 1342 base pairs 
(within the region spanning the Sau 3A site 3* to the 
EcoRI site and the Hindlll site) and analysis of all 
possible reading frames has allowed for the development 
5 of DNA and amino acid sequence information set out in 
Table V. In the Table, the putative initial amino acid 
residue of the amino terminal of mature EPO (as verified 
by correlation to the previously mentioned sequence ana- 
lysis of twenty amino terminal residues) is designated by 

10 the numeral +1 . ' The presence of a. methionine-specif ying 
ATG codon (designated -27) "upstream" of the initial 
amino terminal alanine residue as the first residue 
designated for the amino acid sequence of the mature pro- 
tein is indicative of the likelihood that EPO is ini- 

15 tially expressed in the cytoplasm in a precursor form 
including a 27 amino acid "leader" region which is 
excised prior to entry of mature EPO into circulation. 
Potential glycosylat ion sites within the polypeptide are 
designated by asterisks. The estimated molecular weight 

20 of the translated region was determine to be 21,117 

daltons and the M.W. of the 165 residues of the polypep- 
tide constituting mature monkey EPO was determined to be 
18,236 daltons. 

25 
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The polypeptide sequence of Table V may readily 
be subjected to analysis for the presence- of highly 
hydrophilic regions and/or secondary conformational 
characteristics indicative of potentially highly immuno- 
5 genie regions by, e.g., the methods of Hopp, et al . , 

P.N. A. 5. (U.S.A. ) , 78 , pp. 3824-3828 (1981) and Kyte et 
al., J.Mol.Biol. , 157 , pp. 105-132 (1982) and/or Chou, et 
al., Biochem. , 13 , pp. 222-245 (1974) and Advances in 
Enzymology , 47, pp. 45-47* (1978). Computer-assisted ana- 
10 lysis according to the Hopp, et al. method is available 
by means of a program designated PEP Reference Section 
6.7 made available by Intelligenet ics , Inc., 124 
University Avenue, Palo Alto, California. 

15 EXAMPLE 4 

A. Human Genomic Library 

A Ch4A phage-borne human fetal liver genomic 
library prepared according to the procedures of Lawn, et 
20 al., Cell , 18 , pp. 533-543 (1979) was obtained and main- 
tained for use in a plaque hybridization assay. 

B. Plaque Hybridization Procedures For 
Screening Human Genomic Library 

25 Phage particles were lysed and the DNAs were 

fixed on filters (50,000 plaques per filter) according to 
the procedures of Woo, Methods In Enzymoloqy , 68 , pp. 
389-395 (1979) except for the use of GeneScreen Plus • 
filters' (New England Nu clear Catalog No. NEF-976) and 

30 NZYAM plates (NaCl, 5g ; MgCl 2 -6H 2 0, 2 g; NZ-Amine A, 10gj 
yeast extract, 5g; casamino acids, 2 g; maltose; 2g; and 
agar, 15g per liter). 

The air-dried filters were baked at 80'C for 1 
hour and then digested with Proteinase K as described in 

35 Example 3, Part B. Prehybridization was carried out with 
a 1M NaCl - 1% SDS buffer for 55*C for 4 hours or more, 
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after which the buffer was removed. Hybridization and 
post-hybridization washings were carried out as described 
in Example 3, Part B. Both the mixture of 128 20-mer 
probes designated EPV and the mixture of 128 17-mer pro- 
5 bes of Table III (designated the EPQ mixture) were 

employed. Hybridization was carried out at 48*C using 
the EPV probe mixture. EPQ probe mixture hybridization 
was carried out at 46*C — 4 degrees below the lowest 
calculated Td for members of the mixture. Removal of the 

10 hybridized probe for rehybridization was accomplished by 
boiling with 1 x SSC - 0.1% SDS for two minutes. 
Autoradiography of the filters revealed three positive 
clones (reactive with both probe mixtures) among the 
1,500,000 phage plaques screened. Verification of the 

15 positive clones as being EPO-encoding was obtained 

through DNA sequencing and electron micrographic visuali- 
zation of heteroduplex formation with the monkey cDNA of 
Example 3. This procedure also gave evidence of multiple 
introns in the genomic DNA sequence . 

20 

EXAMPLE 5 

Nucleotide sequence analysis of one of the posi- 
tive clones (designated XhEl) was carried out and results 
25 obtained to date are set out in Table VI. 



30 



35 
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In Table VI, the initial continuous DMA 
sequence designates a top strand of 620 bases in what is 
apparently an untranslated sequence immediately preceding 
a translated portion of the human EPO gene. More speci- 
5 fically, the sequence appears to comprise the 5* end of 
the gene which leads up to a translated DNA region coding 
for the first four amino acids (-27 through -24) of a 
leader sequence ('^resequence*' ) . Four base pairs in the 
sequence prior to that encoding the beginning of the 

10 leader have not yet been unambiguously determined and are 
therefore designated by an "X". There then follows an 
intron of about 639 base pairs (439 base pairs of which 
have been sequenced and the remaining 200 base pairs of 
which are designated M I.S. ,a ) and immediately preceding a 

15 codon for glutamlne which has been designated as residue 
-23 of the translated polypeptide. The exon sequence 
immediately following is seen to code for amino acid 
residues through an alanine residue (designated as the +1 
residue of the amino acid sequence of mature human EPO) 

20 to the codon specifying threonine at position +26, 

whereupon there follows a second intron consisting of 256 
bases as specifically designated. Following this intron 
is an exon sequence for amino acid residues 27 through 55 
and thereafter a third intron comprising 612 base pairs 

25 commences. The subsequent exon codes for residues 56 
through 115 of. human EPO and there then commences a 
fourth intron of 134 bases as specified. Following the 
fourth intron is an exon coding for residue Nos. 116 
through 166 and a "stop" codon (TGA ) . Finally, Table VI 

30 identifies a sequence of 568 base pairs in what appears 
to be an untranslated 3' region of the human EPO gene, 
two base pairs of which ("X" ) have not yet been unam- 
biguously sequenced . 

Table VI thus serves to identify the primary 

35 structural conformation (amino acid sequence) of mature 
human EPO as including 166 specified amino acid residues 
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(estimated M.W. = 18,399). Also revealed in the Table is 
the DNA sequence coding for a 27 residue leader sequence 
along with 5 f and 3* DNA sequences which may be signifi- 
cant to promoter/operator functions of the human gene 
5 operon. Sites for potential glycosylation of the mature 
human EPO polypeptide are designated in the Table by 
asterisks. It is worthy of note that the specific amino 
acid sequence of -Table VI likely constitutes that of a 
naturally occurring allelic form of human erythropoietin. 

10 Support for this position is found in the results of con- 
tinued efforts at sequencing of urinary isolates of human 
erythropoietin which provided the finding that'a signifi- 
cant number of erythropoietin molecules therin have a 
methionine at residue 126 as opposed to a serine as shown 

15 in the Table. 

Table VII, below, illustrates the extent of 
polypeptide sequence homology between human and monkey 
EPO. In the upper continuous line* of the Table, single 
letter designations are employed to represent the deduced 

20 translated polypeptide sequences of human EPO commencing 
with residue -27 and the lower continuous line shows the 
deduced polypeptide sequence of monkey EPO commencing at 
assigned residue number -27. Asterisks are employed to 
highlight the sequence homologies. It should be noted 

25 that the deduced human and monkey EPO sequences reveal an 
"additional" lysine (K) residue at (human) position 116. 
Cross-reference to Table VI indicates that this residue 
is at the margin of a putative mRNA splice junction in 
the genomic sequence. Presence of the lysine residue in 

30 the human polypeptide sequence was further verified by 
sequencing of a cDNA human sequence clone prepared from 
mRNA isolated from C0S-1 cells transformed with the human 
genomic DNA in Example 7, infra . 
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EXAMPLE 6 

The expression system selected for initial- 
attempts at microbial synthesis of isolatable quantities 
5 of EPO polypeptide material coded for by the monkey cDNA 

provided by the procedures of Example 3 was one involving 
mammalian host cells (i.e., C0S-1 cells, A.T.C.C. No. 
CRL-1650). The cells were transfected with a "shuttle" 
vector capable of autonomous replication in E .coli host 

10 (by virtue of the presence of pBR322-der ived DNA) and the 
mammalian hosts (by virtue of the presence of SV40 virus- 
derived DNA). 

More specifically, an expression vector was 
constructed according to the following procedures. The 

15 plasmid clone 83 provided in Example 3 was amplified in 
E .coli and the approximately 1.4kb monkey EPO-encoding 
DNA was isolated by EcoR I and Hindlll digestion. 
Separately, isolated was an approximately 4.0 kb , 
Hind lll /Sal l fragment from pBR322. An approximately 30 

20 bp, Eco RI /Sal l "linker" fragment was obtained from 
M13mpl0 RF DNA (P and L Laboratories). This linker 
included, in series, an Eco RI sticky end, followed by 
SstI, Smal, Bam HI and Xba l recognition sites and a Sai l 
sticky end. The above three fragments were ligated to 

25 provide an approximately 5.4 kb intermediate plasmid 

("pERS") wherein the EPO ONA was flanked on one side by a 
"bank" of useful restriction endonuclease recognition 
sites. pERS was then digested ' with Hind lll and Sai l to 
yield the EPO DNA and the EcoRI to Sai l (M13mpl0) linker. 

30 The 1.4 kb fragment was ligated with an approximately 4.0 
kb Bam HI /Sal l of pBR322 and another M13mpl0 Hind I I I/Bam HI 
RF fragment linker also having approximately 30 bp. The 
M13 linker fragment was characterized by a Hind lll sticky 
end, followed by Pst I , Sai l , Xba l recognition sites and a 

35 Bam HI sticky end. The ligation product was, again, a 

useful intermediate plasmid ( "pBR-EPO" ) including the EPO 
DNA flanked on both sides by banks of restriction site. 
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The vector chosen for expression of the EPO DNA 
in COS-1 cells ("pDSVLl") had previously been constructed 
to allow for selection and autonomous replication in 
E.coli . These characteristics are provided by the origin 
5 of replication and Ampicillin resistance gene DNA sequen- 
ces present in the region spanning nucleotides 2448 
through 4362 of pBR322. This sequence was structurally 
modified by the addition of a linker providing a Hindlll 
recognition immediately adjacent nucleotide 2448 prior to 

10 incorporation into the vector. Among the selected vec- 
tor's other useful properties was the capacity to autono- 
mously replicate in COS-1 cells and the presence of a 
viral promoter sequence functional in mammalian cells. 
These characteristics are provided by the origin of 

15 replication DNA sequence and "late gene" viral promoter 
DNA sequence present in the 342 bp sequence spanning 
nucleotide numbers 5171 through 270 of the SV40 genome. 
A unique restriction site (BamHI) was provided in the 
vector and immediately adjacent the viral promoter 

20 sequence through use of/ a commercially available linker 
sequence [Collaborative Research), Also incorporated in 
the vector was a 237 base pair sequence (derived as 
nucleotide numbers 2553 through 2770 of SV40) containing 
the "late gene" viral mRNA polyadenylat ion signal 

25 (commonly referred to as a transcription terminator). 

This fragment was positioned in the vector in the proper 
orientation vis-a-vis the "late gene" viral promoter via 
the unique BamH I site. Also present in the vector was 
another .mammalian gene at a location not material to 

30 potential transcription of a gene inserted at the unique 
BamH I site, between the viral promoter and terminator 
sequences . [The mammalian gene comprised an approxima- 
tely 2,500 bp mouse dihydrof olate reductase (DHFR) mini- 
gene isolated from plasmid pMG-1 as in Gasser, et al., 

35 P.N.A.S. (U.S.A. ) » 79, pp. 6522-6526, (1982).] Again, 
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the major operative components of plasmid pDSVLl comprise 
•nucleotides 2448 through 4362 of pBR322 along with 
nucleotides 5171 through 270 (342bp) and 2553 through 
2770 (237bp) of SV40 DNA • 
5 Following procedures described, e.g., in 

Maniatis, et al., supra , the EPO-encoding DNA was iso- 
lated from plasmid pBR-EPO as a Bam HI fragment and 
ligated into plasmid pDSVLl cut with Bam HI. Restriction 
enzyme analysis was employed to confirm insertion of the 

10 EPO gene in the correct orientation in two of the 

resulting cloned vectors (duplicate vectors H and L). 
See Figure 2, illustrating plasmid pDSVL-MkE. Vectors 
with EPO genes in the wrong orientation were saved for 
use as negative controls in transfection experiments 

15 designed to determine EPO expression levels in hosts 
transformed with vectors having EPO DNA in the correct 
orientation . 

Vectors H, L, F, X and G were combined with 
carrier DNA (mouse liver and spleen DNA) were employed to 

20 transfect duplicate 60mm plates by calcium phosphate 
microprecipitate methods. Duplicate 60 mm plates were 
also transfected with carrier DNA as a "mock" transfor- 
mation negative control. After five days all culture 
media were tested for the presence of polypeptides 

25 possessing the immunological properties of naturally- 
occurring EPO. 

.• EXAMPLE 7 

30 A. Initial EPO Expression System 

Involving COS-1 Cells 

The system selected for initial attempts at 
microbial synthesis of isolatable quantities of human EPO 
polypeptide material coded for by the human genomic ONA 
35 EPO clone, also involved expression in mammalian host 
cells (i.e., COS-1 cells, A.T.C.C. No. CRL-1650). The 
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human EPO gene was first sub-cloned into a "shuttle" vec- 
tor which is capable of autonomous replication in both 
E.coli hosts (by virtue of the presence of pBR322 derived 
DNA) and in the mammalian cell line COS-1 (by virtue of 
5 the presence of SV40 virus derived DNA). The shuttle 
vector, containing the EPO gene, was then transfected 
into COS-1 cells, EPO polypeptide material was produced 
in the transfected cells and secreted into the cell 
culture media. 

10 More specifically, an expression vector was 

constructed according to the following procedures. DNA 
isolated from lambda clone XhEl, containing the human 
genomic EPO gene, was digested with BamHI and Hindlll 
restriction endonucleases , and a 5.6 Kb DNA fragment 

15 known to contain the entire EPO gene was isolated. This 
fragment was mixed and ligated with the bacterial plasmid 
pUC8 (Bethesda Research Laboratories, Inc.) which had 
been similarly digested, creating the intermediate 
plasmid "pUCS-HuE" , providing a convenient source of this 

20 restriction fragment. 

The vector chosen for expression of the EPO DNA 
in COS-1 cells (pSVASEt) had previously been constructed. 
Plasmid pSV4SEt contained DNA sequences allowing selec- 
tion and autonomous replication in E . coli . These charac- 

25 teristics are provided by the origin of replication and 
Ampicillin resistance gene DNA sequences present in the 
region spanning nucleotides 2448 through 4362 of the bac- 
- terial plasmid pBR322. This sequence was structurally 
modified by the addition of a linker providing a Hindlll 

30 recognition site immediately adjacent to nucleotide 2448. 
Plasmid pSV4SEt was also capable of autonomous replica- 
tion in COS-1 cells. This characteristic was provided by 
a 342 bp fragment containing the SV40 virus origin of 
replication (nucleotide numbers 5171 through 270). This 

35 fragment had been modified by the addition of a linker 
providing an EcoR l recognition site adjacent to 
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nucleotide 270 and a linker providing a Sai l recognition 
site adjacent nucleotide 5171. A 1061 bp fragment of 
SV40 was also present in this vector (nucleotide numbers 
1711 through 2772 plus a linker providing a Sai l recogni- 
5 tion site next to nucleotide number 2772). Within this 
fragment was an unique BamHI recognition sequence. In 
summary, plasmid pSV4SEt contained unique Bam HI and 
Hind lll recognition sites, allowing insertion of the 
human EPO gene, sequences allowing replication and selec- 

10 tion in E.coli , and sequences allowing replication in 
C0S-1 cells. 

In order to insert the EPO gene into pSV4SEt, 
plasmid pUC8-HuE was digested with Bam HI and Hindlll 
restriction endonucleases and the 5.6 kb EPO encoding DNA 

15 fragment isolated. pSV4SEt was also digested with Bam HI 
and Hind lll and the major 2513 bp fragment isolated 
(preserving all necessary functions). These fragments 
were mixed and ligated, creating the final vector 
"pSVgHuEPO" . (See, Figure 3.) This vector was propa- 

20 gated in E.coli and vector DNA isolated. Restriction 

enzyme analysis was employed to confirm insertion of the 
EPO gene. 

Plasmid pSVgHuEPO DNA was used to express human 
EPO polypeptide material in C0S-1 cells. More specifi- 

25 cally, pSVgHuEPO DNA was combined with carrier DNA and 
transfected into triplicate 60 mm plates of C0S-1 cells. 
As a control, carrier DNA alone was also- transfected into 
COS-1 cells. Cell culture media. were sampled five and 
seven days later and tested .for the presence of polypep- 

30 tides possessing the immunological properties of 
naturally occurring human EPO . 

B. Second EPO Expression System 

Involving COS-1 Cells 

35 Still another system was designed to provide 

■ improved production of human EPO polypeptide material 
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coded by the human genomic DNA EPO clone in COS-1 cells 
(A.T.C.C. No. CRL-1650). 

In the immediately preceding system, EPO was 
expressed in COS-1 cells using its own promoter which is 
5 within the 5.6 Kb SamH I to Hindlll restriction fragment. 
In the following construction, the EPO gene is altered so 
that it is expressed using the SV40 late promoter. 

More specifically, the cloned 5.6 Kb BamH I to 
Hind lll genomic human EPO restriction fragment was 

10 modified by. the following procedures. Plasmid pUC8-HuE, 
as described above, was cleaved with Bam HI and with 
BstEII restriction endonucleases . BstEII cleaves within 
the 5.6 Kb EPO gene at a position which is 44 base pairs 
5' to the initiating ATG coding for the pre-peptide and 

15 approximately 680 base pairs 3* to the Hindlll restric- 
tion site. The approximately 4900 base pair fragment was 
isolated. A synthetic linker DNA fragment, containing 
Sai l and BstE II sticky ends and an internal Bam HI 
recognition site was synthesized and purified. The two 

20 fragments were mixed and ligated with plasmid pBR322 
which had been cut with Sai l and BamHl to produce the 
intermediate plasmid pBRgHE. The genomic human EPO gene 
can be isolated therefrom as a 4900 base pair BamH I 
digestion fragment carrying the complete structural gene 

25 with a single ATG 44 base pairs 3* to BamHI site adjacent 
the amino terminal coding region. 

This fragment was isolated and inserted as a 
BamH I fragment into BamH I cleaved expression vector 
plasmid pDSVLl (described in Example 6). The resulting 

30 plasmid, pSVLgHuEPO, as illustrated in Figure 4, was used 
to express EPO polypeptide material from COS-1 cells, as 
described in Examples 6 and 7A. 



35 



EXAMPLE 8 

.Culture media from growth of the six transfected 
COS-1 cultures of Example 6 were analyzed by radioim- 
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munoassay according to the procedures set forth in 
Example 2, Part B. Each sample was assayed at 250, 125, 
50, and 25 microliter aliquot levels. Supernatants from 
growth of cells mock transfected or transfected with vec- 
5 tors having incorrect EPO gene orientation were unam- 
biguously negative for EPO immunoreact ivit y . For each 
sample of the two supernatants derived from growth of 
COS-1 cells transfected with vectors (H and L) having the 
EPO DNA in the correct orientation, the % inhibition of 

10 125 I-EP0 binding to antibody ranged from 72 to 88*, which 
places all values at the top of the standard curve. The 
exact concentration of EPO in the culture supernatant 
could not then reliably be estimated. A quite conser- 
vative estimate of 300 mil/ml was made, however, from the 

15 value calculation of the largest aliquot size (250 
microliter ) . 

A representative culture fluid according to 
Example 6 and five and seven day culture fluids obtained 
according to Example 7A were tested in the RIA in order 

20 to compare activity of recombinant monkey and human EPO 
materials to a naturally-occurring human EPO standard and 
the results are set out in graphic form in Figure 1. 
Briefly, the results expectedly revealed that the recom- 
binant monkey EPO significantly competed for anti-human 

25 EPO antibody although it was not able to completely ^inhi- 
bit binding under the test conditions. The maximum per- 
cent inhibition values for recombinant human EPO, 
however, closely approximated those of the human EPO 
standard, The .parallel nature .of the dose response 

30 curves suggests immunological identity of the sequences 
(epitopes) in common. Prior estimates of monkey EPO in 
culture fluids were re-evaluated at these higher dilution 
levels and were found to range from 2.91 to 3.12 U/ml. 
Estimated human EPO production levels were correspon- 

35 dingly set at 392 mil/ml for the five-day growth sample 
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and 567 mU/ml for the seven day growth sample. Estimated 
monkey EPO production levels in the Example 7B expression 
system were on the same order or better. 

5 EXAMPLE 9 

Culture fluids prepared according to Examples 6 
and 7 were subjected to an in vitro assay for EPO acti- 
vity according to the procedure of Gqldwasser, et al., 

10 Endocrinology , 97, 2, pp. 315-323 (1975). Estimated 

monkey EPO values for culture fluids tested ranged from 
3.2 to 4.3 U/ml. Human* EPO culture fluids were also 
active in this in vitro assay and, further, this activity 
could be neutralized by anti-EPO antibody. The recom- 

15 binant monkey EPO culture fluids according to Example 6 
were also subjected to an assay for ijn vivo biological 
activity according to the general procedures of Cotes, et 
al., Nature , 191 , pp. 1065-1067 (1961) and Hammond, et 
al., Ann.N. Y.Acad. Sci. . 149, pp. 516-527 (1968) and acti- 

20 vity levels ranged from 0.94 to 1.24 U/ml. 

EXAMPLE 10 

In the previous examples, recombinant monkey or 
25 human EPO material was produced from vectors used to 

transfect COS-1 cells. These vectors replicate in COS-1 
cells due to the presence of SV40 T antigen within the 
cell and an SV40 origin of replication on the vectors. 
Though these vectors produce useful, quantities' of EP0:in_. 
30 COS-1 cells, expression is only transient (7 to 14 days) 
due to the eventual loss of the vector. Additionally, 
only a small percentage of COS-1 became productively 
transfected with the vectors. The present example 
describes expression systems employing Chinese hamster 
35 ovary (CHO) DHFR~ cells and the selectable marker, DHFR . 
[For discussion of related expression systems, see 
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U.S. Letters Patent No. 4,399,216 and European Patent 
Applications 117058, 117059 and 117060, 'all published 
August 29, 1984.] 

CHO DHFR" cells (DuX-Bll) CHO Kl cells, Urlaub, 
5 et al., Proc. Nat. Acad. Sci. (U.S.A.) , Vol. 77, 4461 

(1980) lack the enzyme dihydrof olate reductase (DHFR) due 
to mutations in the structural genes and therefore 
require the presence of glycine, hypoxanthine , and thymi- 
dine in the culture media. Plasmids pDSVL-MkE (Example 

10 6) or pDSVL-gHuEPO (Example 7B) were transfected along 
with carrier DNA into CHO DHFR - cells growing in media 
containing hypoxanthine, thymidine, and glycine in 60 mm 
culture plates. Plasmid pSVgHuEPO (Example 7A) was mixed 
with the plasmid pMG2 containing a mouse dihydrof olate 

15 reductase gene cloned into the bacterial plasmid vector 
pBR322 (per Gasser, et al., supra . ) The plasmid mixture 
and carrier DNA was transfected into CHO DHFR" cells. 
(Cells which acquire one plasmid will generally also 
acquire a second plasmid). After three days, the cells 

20 were dispersed by tr ypsinization into several 100 mm 

culture plates in media lacking hypoxanthine and thymi- 
dine. Only those cells which have been stably trans- 
' formed with the DHFR gene, and thereby the EPO gene, 
survive in this media. After 7-21 days, colonies of sur- 

25 viving cells became apparent. These transformant colo- 
nies, after dispersion by trypsinization can be 
continuously propagated in media lacking hypoxanthine and 
thymidine, creating new cell- strains (e.g., CHQ 
_ pDSVL-MkEPO, CHO pSV.gHuEPO, CHO-pDSVL-gHuE.PO ) 

30 Culture fluids from the above cell strains were 

tested in the RIA for the presence of recombinant monkey 
or human EPO. Media for strain CHO pDSVL-MkEPO contained 
EPO with immunological properties like that obtained from 
COS-1 cells transfected with plasmid pDSVL-MkEPO. A 

35 representative 65 hour cul.ture fluid contained monkey EPO 
at 0.60 U/ml. 
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Culture fluids from CHO pSVgHuEPO and CHO 
pDSVL-gHuEPO contained recombinant human EPO with immuno 
logical properties like that obtained with COS-1 cells 
transfected with plasmid pSVgHuEPO or pDSVL-gHuEPO. A 
5 representative 3 day culture fluid from CHO pSVgHuEPO 
contained 2.99 U/ml of human EPO and a 5.5 day sample 
from CHO pDSVL-gHuEPO had 18.2 U/ml of human EPO as 
measured by the RIA. 

The quantity of EPO produced by the cell strain 

10 described above can be increased by gene amplification 
giving new cell strains of greater productivity. The 
enzyme dihydrof olate reductase (DHFR ) which is the pro- 
duct coded for by the DHFR gene can be inhibited by the 
drug methotrexate (MTX) • More specifically, cells propa 

15 gated in media lacking hypoxanthine and thymidine are 
inhibited or killed by MTX . Under the appropriate con- 
ditions, (e.g., minimal concentrations of MTX) cells 
resistant to and able to grow in MTX can be obtained. 
These cells are found to be resistent to MTX due to an 

20 amplification of the number of their DHFR genes, result- 
ing in increased production of DHFR enzyme. The sur- 
viving cells can, in turn, be treated with increasing 
concentrations of MTX, resulting in cell strains con- 
taining greater numbers of DHFR genes. "Passenger genes 

25 (e.g., EPO) carried on the expression vector along with 
the DHFR gene or transformed with the DHFR gene are fre- 
quently found also to be increased in their gene copy 
number . ■ . - 

As examples of practice of- this amplification 

30 system, cell strain CHO pDSVL-MkE was subjected to 

increasing MTX concentrations (0 nM, 30 nM and 100 nM). 
Representative 65-hour culture media samples from each 
amplification step were assayed by RIA and determined to 
contain 0.60, 2.45 and 6.10 U/ml, respectively. Cell 

35 strain CHO pDSVL-gHuEPO was subjected to a series of 
increasing MTX concentrations of 30 nM, 50 nM, 100 nM, 
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200 nM, 1 uM, and 5 uM MTX . A representative 3-day 
culture media sample from the 100 nM MTX step contained 
human EPO at 3089 ± 129 u/ml as judged by RIA. 
Representative 48 hour cultural medium samples from the 
5 100 nM and 1 uM MTX steps contained, respectively, human 
EPO at 466 and 1352 U/ml as judged by RIA (average of 
triplicate assays). In these procedures, 1 x 10 6 cells 
were- plated in 5 ml of media in 60 mm culture dishes* 
Twenty-four hours later the media were removed and 

10 replaced with 5 ml of serum-free media (high glucose OMEM 
supplemented with 0.1 mM non-essential amino acids and 
L-glutamine) . EPO was allowed to accumulate for 48 hours 
in the serum-free media. The media was collected for RIA 
assay and the cells were trypsinized and counted. The 

15 average RIA values of 467 U/ml and 1352 U/ml for cells 
grown at 100 nM and 1 yM MTX, respectively, provided 
actual yields of 2335 U/plate and 6750 U/plate. The 
average cell numbers per plate were 1.94 x 10 6 and 
3.12 x 10 6 cells, respectively. The effective production 

20 rates for these culture conditions were thus 1264 and 
2167 U/10 6 cells/48 hours. 

The cells in the cultures described immediately 
above are a* genetically heterogeneous population. 
Standard screening procedures are being employed in an 

25 attempt to isolate genetically hemogeneous clones with 

the highest production capacity. See, Section A, Part 2, 
of "Points to Consider in the Characterization of Cell 
Lines Used .to Produce Biologies" , June 1, 1984, Office of 
Biologies Research Review, Center for Drugs and 

30 Biologies, U.S. Food and Drug Administration. 

The productivity of the EPO producing CHO cell 
lines described above can be improved by appropriate cell 
culture techniques. The propagation of mammalian cells 
in culture generally requires the presence of serum in 

35 the growth media. A method for production of erythro- 
poietin from CHO cells in media that does not contain 
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serum greatly facilitates the purification of erythro- 
poietin from the culture medium. The method described 
below is capable of economically producing erythropoietin 
in serum-free media in large quantities sufficient for 
5 production. 

Strain CHO pOSVL-gHuEPO cells, grown in standard 
cell culture conditions, are used to seed spinner cell 
culture flasks. The cells are propagated as a suspension 
cell line in the spinner cell culture flask in media con- 

10 sisting of a 50-50 mixture of high glucose DMEM and Ham's 
F12 supplemented with 5% fetal calf serum, L-gluta- 
mine, Penicillin and Streptomycin, 0.05 mM non-essential 
amino acids and the appropriate concentration of metho- 
trexate. Suspension cell culture allows the EPO-produc- 

15 ing CHO cells to be expanded easily to large volumes. 
CHO cells, grown in suspension, are used to seed roller 
bottles at an initial seeding density of 1.5 x 10 7 viable 
cells per 850 cm 2 roller bottle in 200 ml of media. The 
cells are allowed to grow to confluency as an adherent 

20 cell line over a three-day period. The media used for 
this phase of the growth is the same as used for growth 
in suspension. At the end of the three-day growth 
period, the serum containing media is removed and 
replaced with 100 ml of serum-free media; 50-50 mixture 

25 of high glucose DMEM and Ham's F12 supplemented with 0.05 
mM non-essential amino acids and L-glutamihe. The 
roller bottles are returned to the roller bottle incuba- 
tor for a period of 1-3 hours and the media again is 
removed and replaced with 100 ml of fresh serum-free 

30 media. The 1-3 hour incubation of the serum-free media 
reduces the concentration of contaminating serum pro- 
teins. The roller bottles are returned to the incubator 
for seven days during which erythropoietin accumulates in 
the serum-free culture media. At the end of the seven- 

35 day production phase, the conditioned media is removed 
and replaced with fresh serum-free medium- for a second 
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production cycle. As an example of the practice of this 
production system, a representative seven-day, serum-free 
media sample contained human erythropoietin at 3892+409 
U/ml as judged by the RIA. Based on an estimated cell 
5 density of 0.9 to 1.8 x 10 5 cells/cm 2 , each 850 

cm 2 roller bottle contained from 0.75 to 1.5 x 10 8 cells 
and thus the rate of production of EPO in the 7-day, 100 
ml culture was 750 to 1470 U/10 6 cells/48 hours. 

Culture fluids from cell strain CHO pDSVL-MkEPQ 

10 carried in 10 nM MTX were subjected to RIA ia vitro and 
in vivo EPO activity assays. The conditioned media 
sample contained 41.2 ± 1.4 U/ml of MkEPO as measured by 
the RIA, 41.2 ± 0.064 U/ml as measured by the iji vitro 
biological activity assay and 42.5 ± 5 U/ml as measured 

15 by the in vivo biological activity assay. Amino acid 

sequencing of polypeptide products revealed the presence 
of EPO products, a principle species having 3 residues of 
the "leader" sequence adjacent the putative amino ter- 
minal alanine. Whether this is the result of incorrect 

20 membrane processing of the polypeptide in CHO cells or 
reflects a difference in structure of the amino terminus 
of monkey EPO vis-a-vis human EPO, is presently unknown. 

Culture fluids from cell strain CHO pDSVL-gHuEPO 
were subjected to the three assays. A 5.5 day sample 

25 contained recombinant human EPO in the media at a level 
of 18.2 U/ml by RIA assay, 15.8 ± 4.6 U/ml by in vitro 
assay and 16.8 ± 3.0 U/ml by ijn vivo assay. 

Culture fluid from CHO pDSVL-gHuEPO cells pre- 
pared amplified by stepwise 100 nM MTX were subjected to. 

30 the three assays. A 3.0 day sample contained recombinant 
human EPO at a level of 3089 ± 129 U/ml by RIA, 2589 ± 
71.5 U/ml by ir± vitro assay, and 2040 ± 160 U/ml by in 
vivo assay. Amino acid sequencing of this product 
reveals an amino terminal corresponding to that 

35 designated in Table VI. 

Cell conditioned media 'from CHO. cells trans- 
fected with plasmid pDSVL-MkE in 10 nM MTX were pooled, 
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and the MTX dialyzed out over several days, resulting In 
media with an EPO activity of 221 ± 5,1 U/ml (EPO-CCM) • 
To determine the in vivo effect of the EPO-CCM upon hema- 
tocrit levels in normal Baib/C mice, the following 
5 experiment was conducted. Cell conditioned media from 
untransfected CHO cells (CCM} and EPO-CCM were adjusted 
with PBS. CCM was used for the control group (3 mice) 
and two dose levels of EPO-CCM — 4 units per injection 
and 44 units per injection — were employed for the 

10 experimental groups (2 mice/group). Over the course of 5 
weeks, the seven mice were injected intraper itoneally , 3 
times per week. After the eighth injection, average 
hematocrit values for the control group were determined 
to be 50. 4X5 for the 4U group, 55.1%? and, for the 44U 

15 group, 67.9%. 

Mammalian cell expression products may be 
readily recovered in substantially purified form from 
culture media using HPLC (C 4 ) employing an ethanol gra- 
dient, preferably at pH7. 

20 A preliminary attempt was made to characterize 

recombinant glycoprotein products from conditioned medium 
C0S-1 and CHO cell expression of the human EPO gene in 
comparison to human urinary EPO isolates using both 
Western blot analysis and SDS-PAGE. These studies indi- 

25 cated that the CHO-produced EPO material had a somewhat 
higher molecular weight than the C0S-1 expression product 
which, in turn, was slightly larger than the pooled 
source human urinary extract. *~ All products were somewha-t 
heterogeneous. Neuraminidase enzyme treatment, to remove 

30 sialic acid resulted in C0S-1 and CHO recombinent pro- 
ducts of approximately equal molecular weight which were 
both nonetheless larger than the resulting asialo human 
urinary extract. Endoglycosidase F enzyme (EC 3.2.1) 
treatment of the recombinant CHO product and the urinary 

35 extract product (to totally remove carbohydrate from 
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both) resulted in substantially homogeneous products 
having essentially identical molecular weight charac- 
teristics • 

Purified human urinary EPO and a recombinant, 
5 CHO cell-produced, EPO according to the invention were 
subjected to carbohydrate analysis according to the pro- 
cedure of Ledeen, et al. Methods in Enzymology , 
83(Part 0) , 139-191 (1982) as modified through use of the 
hydrolysis procedures of Nesser, et al., Anal .Biochem. , 

10 142 , 58-67 (1984). Experimentally determined car- 
bohydrate constitution values (expressed as molar ratios 
of carbohydrate in the product) for the urinary isolate 
were as follows: Hexoses, 1.73; N-acetylglucosamine , 1; 
N-acetylneuraminic acid, 0.93; Fucose, 0; and N-acetyl- 

15 galactosamine , 0. Corresponding values for the recom- 
binant product (derived from CHO pDSVL-gHuEPO 3-day 
culture media at 100 nM MTX) were as follows: Hexoses, 
15.09; N-acetylglucosamine, 1; N-acetylneuraminic acid, 
0.998; Fucose, 0; and N-ace tylgalactosamine , 0. These 

20 findings are consistent with the Western blot and 
SDS-PAGE analysis described above. 

Glycoprotein products provided by the present 
invention are thus comprehensive of products having a 
primary structural conformation sufficiently duplicative 

25 of that of a naturally-occurring erythropoietin to allow 
possession of one or more of the biological properties 
thereof and having an average carbohydrate composition 
which differs from that of naturally-occurring erythro- 
poietin . ; . 

30 EXAMPLE 11 

The present example relates to the total manu- 
facture by assembly of nucleotide bases of two structural 
genes encoding the human species EPO sequence of Table VI 
35 and incorporating, respectively "preferred 11 codons for 
expression in E.coli and yeast C S .cerevisiae ) cells. 
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Also described is the construction of genes encoding ana- 
logs of human EPO. Briefly stated, the protocol employed 
was generally as set out in the previously noted disclo- 
sure of Alton, et al. (WO 83/04053). The genes were 
5 designed for- initial assembly of component oligonucleoti- 
des into multiple duplexes which, in turn, were assembled 
into three discrete sections. These sections were 
designed for ready amplification and, upon removal from; 
the amplification system, could be assembled sequentially 
10 or through a multiple fragment ligation in a suitable 
expression vector. 

Tables VIII through XIV below illustrate the 
design and assembly of a manufactured gene encoding a 
human EPO translation product lacking any leader or pre- 
15 sequence but including an initial methionine residue at 
position -1. Moreoever, the gene incorporated in 
substantial part E.coli preference codons and the 
construction was therefore referred to as the "ECEPO" 
gene. 



20 



25 



30 
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TABLE VIII 
ECEPO SECTION 1 OLIGONUCLEOTIDES 

1 . AATTCTAGAAACCATGAGGGTAATAAAATA 

2 . CCATTATTTTATTACCCTCATGGTTTCTAG 
5 3. ATGGCTCCGCCGCGTCTGATCTGCGAC 

4. CTCGAGTCGCAGATCAGACGCGGCGGAG 

5. TCGAGAGTTCTGGAACGTTACCTGCTG 

6. CTTCCAGCAGGTAACGTTCCAGAACT 

7. GAAGCTAAAGAAGCTGAAAACATC 
10 8. GTGGTGATGTTTTCAGCTTCTTTAG 

9. ACCACTGGTTGTGCTGAACACTGTTC 

10. CAAAGAACAGTGTTCAGCACAACCA 

11. TTTGAACGAAAACATTACGGTACCG 

12. GATCCGGTACCGTAATGTTTTCGTT 

15 

TABLE IX 
ECEPO SECTION 1 

Xba l 

EcoRI 1 . 3 

AATTCT AG AAACCATGAG GGTAATAAAA TAWTGGCTCC GCCGCGTCTG 
GATC TTTGGTACTC CCATTATTTT ATTACqGAGG CGGCGCAGAC 
20 2 4 

atctgcgac It cgaga gTtct ggaacgttac ctgctg|gaag CTAAAGAAGC 

TAGACGCTGA GCTCjTCAAGA .CCTTGCAATG GACGACCTTCj GATTTCTTCG 

TGAAAACATC JaCCACTGGTT GTGCTGAACA CTGTTC tTTTG AACGAAAACA 
ACTTTTGTAG TGGTGACCAA CACGACTTGT GACAAGAAACJ TTGCTTTTGT 
25 8 10 

Kpn l Bam HI 
TTACGGTACC G 
AATGCCATGG CCTAG 
12 



WO 85/02610 PCT/US84/02021 

- 68 - 

TABLE X 

ECEPO SECTION 2 OLIGONUCLEOTIDES 

1. AATTCGGTACCAGACACCAAGGT 

2. GTTAACCTTGGTGTCTGGTACCG 
5 3. . TAACTTCTACGCTTGGAAACGTAT 

4. TTCCATACGTTTCCAAGCGTAGAA 

5. GGAAGTTGGTCAACAAGCAGTTGAAGT 

6. CCAAACTTCAACTGCTTGTTGACCAAC 

7. TTGGCAGGGTCTGGCACTGCTGAGCG 
10 8. GCCTCGCTCAGCAGTGCCAGACCCTG 

9. AGGCTGTACTGCGTGGCCAGGCA 

10. GCAGTGCCTGGCCACGCAGTACA 

11. CTGCTGGTAAACTCCTCTCAGCCGT 

12. TTCCCACGGCTGAGAGGAGTTTACCA 
15 13. GGGAACCGCTGCAGCTGCATGTTGAC 

. 14. GCTTTGTCAACATGCAGCTGCAGCGG 

15. AAAGCAGTATCTGGCCTGAGATCTG 

16. GATCCAGATCTCAGGCCAGATACT 



20 



25 
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TABLE XII 

ECEPO SECTION 3 

1. GATCCAGATCTCTGACTACTCTGC 

5 2. ACGCAGCAGAGTAGTCAGAGATCTG 

3. TGCGTGCTCTGGGTGCACAGAAAGAGG 

4. GATAGCCTCTTTCTGTGCACCCAGAGC 

5. CTATCTCTCCGCCGGATGCTGCATCT 

6. CAGCAGATGCAGCATCCGGCGGAGA 
10 7. GCTGCACCGCTGCGTACCATCACTG 

8. ATCAGCAGTGATGGTACGCAGCGGTG 

"9. CTGATACCTTCCGCAAACTGTTTCG 

10. ATACACGAAACAGTTTGCGGAAGGT 

11. TGTATACTCTAACTTCCTGCGTGGTA 
15 12. CAGTTTACCACGCAGGAAGTTAGAGT 

13. AACTGAAACTGTATACTGGCGAAGC 

14. GGCATGCTTCGCCAGTATACAGTTT 

15. ATGCCGTACTGGTGACCGCTAATAG 

16. TCGACTATTAGCGGTCACCAGTAC 

20 



25 
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TABLE XIII 
ECEPO SECTION 3 



BamH I Bgl ll 
GA TCCAGATCTCTG 
GTCTAGAGAC 



ACTACTCTGC [TGCGT GCTCT GGGTGCACAG AAAGAGG pTA TCT CTCCGCC 
TGATGAGACG ACGCAlCGAGA CCCACGTGTC TTTCTCCGAT AGIAGAGGCGG 
2 4 1 



GGATGCTGCA 
CCTACGACGT 
10 6 



u 2 , . £ 

TCT PCT.GC AC CGCTGCGTAC CATCACT GpT GAT ACCTTCC 
AGACGACjGTG GCGACGCATG GTAGTGACGA CT^TGGAAGG 



11 . 13 

GCAAACTGTT TCG tTGTATA C TCTAACTTCC TGCGTGGTA IA ACTGA AACTG 

CGTTTGACAA AGCACATAITG AGATTGAAGG ACGCACCATT TGACfTTTGAC 
10 12 



15 Sai l 
TATACTGGCG AAGChjGCCG TACTGGTGAC CGCTAATAG 
ATATGACCGC TTCGTACGQC ATGACCACTG GCGATTATC AGCT 
15 14 16 



20 



25 
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TABLE XIV 



ECEPO GENE 



Xba l 
CTAG 



AAACCATGAG 
TTTGGTACTC 



GGTAATAAAA 
CCATTATTTT 



-1 1 
MetAla 
TAATGGCTCC 
ATTACCGAGG 



GCCGCGTCTG 
CGGCGCAGAC 



5 



ATCTGCGACT CGAGAGTTCT GGAACGTTAC CTGCTGGAAG CTAAAGAAGC 
TAGACGCTGA GCTCTCAAGA CCTTGCAATG GACGACCTTC GATTTCTTCG 



TGAAAACATC ACCACTGGTT GTGCTGAACA CTGTTCTTTG AACGAAAACA 
ACTTTTGTAG TGGTGACCAA CACGACTTGT GACAAGAAAC TTGCTTTTGT 



10 TTACGGTACC AGACACCAAG GTTAACTTCT ACGCTTGGAA ACGTATGGAA 
AATGCCATGG TCTGTGGTTC CAATTGAAGA TGCGAACCTT TGCATACCTT 



GTTGGTCAAC AAGCAGTTGA AGTTTGGCAG GGTCTGGCAC TGCTGAGCGA 
CAACCAGTTG TTCGTCAACT TCAAACCGTC CCAGACCGTG ACGACTCGCT 



GGCTGTACTG CGTGGCCAGG CACTGCTGGT AAACTCCTCT CAGCCGTGGG 

. CCGACATGAC GCACCGGTCC GTGACGACCA TTTGAGGAGA GTCGGCACCC 

15 

AACCGCTGCA GCTGCATGTT GACAAAGCAG TATCTGGCCT GAGATCTCTG 

TTGGCGACGT CGACGTACAA CTGTTTCGTC ATAGACCGGA CTCTAGAGAC 



ACTACTCTGC 'tGCGTGCTCT GGGTGCACAG AAAGAGGCTA TCTCTCCGCC 
TGATGAGACG ACGCACGAGA CCCACGTGTC TTTCTCCGAT AGAGAGGCGG 



20 GGATGCTGCA TCTGCTGCAC CGCTGCGTAC CATCACTGCT GATACCTTCC 
CCTACGACGT AGACGACGTG GCGACGCATG GTAGTGACGA CTATGGAAGG 



GCAAACTGTT TCGTGTATAC TCTAACTTCC TGCGTGGTAA ACTGAAACTG 
CGTTTGACAA AGCACATATG AGATTGAAGG ACGCACCATT TGACTTTGAC 

Sail 

TATACTGGCG AAGCATGCCG TACTGGTGAC CGCTAATAG 
ATATGACCGC TTCGTACGGC ATGACCACTG GCGATTATCA GOT 

25 
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More particularly, Table VIII illustrates oligo- 
nucleotides employed to generate the Section 1 of the 
ECEPO gene encoding amino terminal residues of the human 
species polypeptide. Oligonucleotides were assembled 
5 into duplexes (1, and 2, 3 and 4, etc. 3 and the duplexes 
were then ligated to provide ECEPO Section 1 as in Table 
IX. Note that the assembled section includes respective 
terminal' EcoRI and Bam HI sticky ends, that "downstream" 
of the Eco RI sticky end is a Xba l restriction enzyme 

10 recognition site; and that "upstream" of the Bam HI sticky 
end is a Kpn l recognition site. Section 1 could readily 
be amplified using the M13 phage vector employed for 
verification of sequence of the section. Some dif- 
ficulties were encountered in isolating the section as an 

15 Xba l /Kpn l f ragment • f rom RF DNA generated in E .coli , 

likely due to methylation of the Kpn l recognition sitjs 
bases within the host. Single-stranded phage DNA was 
therefore isolated and rendered into double-stranded form 
in vitro by primer extension and the desired double- 

20 stranded fragment was thereafter readily isolated. 

ECEPO gene Sections 2 and 3 (Tables XI and XIII) 
were constructed in a similar manner from the oligo- 
nucleotides of Tables X and XII, respectively. Each 
section was amplified in the M1J vector employed for 

25 sequence verification and was isolated from phage DNA. 
As is apparent from Table XI, ECEPO Section 2 was con- 
structed with Eco RI and Bam HI sticky ends and could be 
isolated as a Kpn I /BqI II fragment. Similarly, ECEPO 
•Section 3 was prepared with BamH I and Sai l sticky ends 

30 and could be isolated from phage RF DNA as a Bql ll /Sal l 
fragment. The three sections thus prepared can readily 
be assembled into a continuous DNA sequence (Table XIV) 
encoding the entire human species EPO polypeptide with an 
amino terminal methionine codon (ATG) for E . coli transla- 

35 tion initiation. Note also that "upstream" of the ini- 
tial ATG is a series of base pairs substantially 
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duplicating the ribosome binding site sequence of the 
highly expressed OMP-f gene of E>coll . 

Any suitable expression vector may be employed 
to carry the ECEPO. The particular vector chosen for 
5 expression of the ECEPO gene as the H temperature sen- 
sitive" plasmid pCFM536 — a derivative of plasmid 
pCFM4l4 (A.T.C.C. 40076) as described in co-pending 
U.S. Patent Application Serial No. 636,727, filed August 
6, .1984, by Charles F. Morris. More specifically, 

10 pCFM536 was digested with Xbal and Hind lll ; the large 

fragment was isolated and employed in a two-part ligation 
with the ECEPO gene. Sections 1 ( Xba l /Kpn l ) 2 
C Kpn I /Bgl ll ) and 3 ( Bgl ll /Sal l) had previously been 
assembled in the correct order in M13 and the EPO gene 

15 was isolated therefrom as a single Xba l /Hind lll fragment. 
This fragment included a portion of the polylinker from 
M13 mp9 phage spanning the Sai l to Hind lll sites therein. 
Control of expression in the resulting expression 
plasmid, £536, was by means of a lambda P^ promoter, 

20 which itself may be under control of the C IQ57 repressor 
gene (such as provided in E .coli strain K12AHtrp). 

The manufactured ECEPO gene above may be 
variously modified to encode erythropoietin analogs such 
as [Asn 2 , des-Pro 2 through Ile 6 ]hEP0 and [His 7 ]hEPQ, as 

25 described below. 

A. [Asn 2 , des-Pro 2 through Ile 6 ]hEP0 

. Plasmid 536 carrying the ECEPO manufactured gene 
of Table XIV as* a Xba l to- Hind lll insert was digested - 
30 with Hind lll and Xhol. The latter endonuclease cuts the 
ECEPO gene at a unique, 6 base pair recognition site 

Q 

spanning the last base of the codon encoding Asp through 
the second base of the Arg 10 codon. A Xba l /Xho l "linker" 
sequence was manufactured having the following sequence: 
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Xbal +1 2 7 8 9 

Met Ala Asn Cys Asp Xho l 
S'-CTAG ATG GCT AAT TGC GAC-3' 

3 , -TAC CGA TTA ACG CTG AGCT-5* 

The Xbal/Xhol linker and the Xhol /Hind lll ECEPO 
5 gene sequence fragment were inserted into the large 
fragment resulting from Xba l and Hindlll digestion of 
plasmid pCFM526 — a derivative of plasmid pCFM4l4 
(A.T.C.C. 40076) — as described in co-pending 
U.S. Patent Application Serial No. 636,727, filed August 
10 6, 1984, by Charles F. Morris, to generate a plasmid- 
borne DNA sequence encoding E.coli expression of the 
Met" 1 form of the desired analog. 

B. [His 7 ] hEPO 

15 Plasmid 536 was digested with Hindlll and Xho l 

as in part A above. A Xba l /Xho l linker was manufactured 
having the following sequence: 

Xba l +1 23456789 Xho l 

Met Ala Pro Pro Arg Leu He His Asp 
20 5'-CTAG ATG GCT CCG CCA CGT CTG ATC CAT GAC-3 

3'-TAC CGA GGC GGT GCA GAC TAG GTA CTG AGCT-5 

The linker and the Xho l /Hind lll ECEPO sequence 
fragment were then inserted ipto pCFM526 to generate a 
plasmid-borne DMA sequence encoding E .coli expression of 

25 the Met" 1 'form of the desired analog. 

Construction of a manufactured gene ("SCEPO 11 ) 
incorporating ■ yeast preference codons is as. described in 
the following Tables XV through XXI. As was the case 
with the ECEPO gene, the entire construction involved 

30 formation of three sets of oligonucleotides (Tables XV, 
XVII and XIX) which were formed into duplexes and 
assembled into sections (Tables XVI, XVIII and XX). Note 
that synthesis was facilitated in part by use of some 
sub-optimal codons in both the SCEPO and ECEPO construe- 
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tions, i.e., oligonucleotides 7-12 of Section 1 of both 
genes were identical, as were oligonucleotides 1-6 of 
Section 2 in each gene. 



10 



15 



20 



25 



30 



WO 85/02610 



PCT/US84/02021 



- 77 - 
TABLE XV 

SCEPO SECTION 1 OLIGONUCLEOTIDES 

1. AATTCAAGCTTGGATAAAAGAGCT 

5 2. GTGGAGCTCTTTTATCCAAGCTTG 

3. CCACCAAGATTGATCTGTGACTC 

4. TCTCGAGTCACAGATCAATCTTG 

5. GAGAGTTTTGGAAAGATACTTGTTG 

6. CTTCCAACAAGTATCTTTCCAAAAC 
10 7. GAAGCTAAA'GAAGCTGAAAACATC 

8. GTGGTGATGTTTTCAGCTTCTTTAG 

9. ACCACTGGTTGTGCTGAACACTGTTC 

10. CAAAGAACAGTGTTCAGC ACAACCA 

11. TTTGAACGAAAACATTACGGTACCG 
15 12. GATCCGGTACCGTAATGTTTTCGTT 

TABLE XVI 
SCEPO SECTION 1 

Eco RI Hindi 1 1 1. 
AATTCA AGCTTGGATA 
20 GT TCGAACCTAT 

2 

AAAGAGCT bc ACC AAgItTG ATCTGTGACT c|sAGAGTTTT ■ 
TTTCTCGAGG TGCTTCTAAC TAGACACf GA GCTCTJCAAAA 

4 

5 . 7 
GGAAAGATAC TTGTTGEAAG CTAAAGAAGC TGAAAACATC feCCACTGGTT 
CCTTTCTATG AACAACCTTCj GATTTCTTCG ACTTTTGTAG TGGTGjACCAA 

6 1 8 

9 , 11 Kpn l Bam HI 

GTGCTGAACA CTGTTc trTTG AACGAAAACA TTACGGTACC G 
CACGACTTGT GACAAGAAACl TTGCTTTTGT AATGCCATGG CCTAG 

! 12 



25 
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TABLE XVII 

SCEPO SECTION 2 OLIGONUCLEOTIDES 

1. AATTCGGTACCAGACACCAAGGT 

5 2. GTTAACCTTGGTGTCTGGTACCG 

3. TAACTTCTACGCTTGGAAACGTAT 

4. TTCCATACGTTTCCAAGCGTAGAA 

5. GGAAGTTGGTCAACAAGCAGTTGAAGT 

6. CCAAACTTCAACTGCTTGTTGACCAAC 
10 7. TTGGCAAGGTTTGGCCTTGTTATCTG 

8. GCTTCAGATAACAAGGCCAAACCTTG 

"9. AAGCTGTTTTGAGAGGTCAAGCCT 

10. AACAAGGCTTGACCTCTCAAAACA 

. 11. TGTTGGTTAACTCTTCTCAACCATGGG 

15 12. TGGTTCCCATGGTTGAGAAGAGTTAACC 

13. AACCATTGCAATTGCACGTCGAT 

14. CTTTATCGACGTGCAATTGCAA 

15. AAAGCCGTCTCTGGTTTGAGATCTG 

16. GATCCAGATCTCAAACCAGAGACGG 

20 



25 
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TABLE XVIII 
SCEPO SECTION 2 



Kpn l 
Eco RI 1 

TTTTttcggtacc AGACACCAAG 
5 GCCATGG TCTGTGGTTC 

2 



, I 5 

GT lTAACT TCT ACGCTTGGAA ACGTAT GGAA GTTGGTCAAC AAGCTGTTGA 
CAATTG^AGA TGCGAACCTT TGCATACCTTf CAACCAGTTG TTCGACAACT 



4 6 
1 i 2 

AGT lTTGGC AA ggtttggcct tgttatctg r agc tgttttg agaggtcaag 

TCAAACTlGTT CCAAACCGGA ACAATAGACT TCGlACAAAAC TCTCCAGTTC 

8 ' 10 

ii u 

GCT ITGTT GGT TAACTCTTCT CAACCATGGG hACCATTGCA ATTGCACGTC 
GGAACTAlCCA ATTGAGAAGA GTTGGTACCC TTGGTIAACGT TAACGTGCAG 

ii M 

15 Bflll I Bam HI 

gat Kaagc cg TCTCTGGTTT GAGATCTG 
15 CTATTTCGGC AGAGACCAAA CTCTAGACCTA G 

16 



20 



25 
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TABLE XIX 

SCEPO SECTION 3 OLIGONUCLEOTIDES 

1. GATCCAGATCTTTGACTACTTTGTT 

5 2. TCTCAACAAAGTAGTCAAAGATCTG 

3. GAGAGCTTTGGGTGCTCAAAAGGAAG 

4. ATGGCTTCCTTTTGAGCACCCAAAGC 

5. CCATTTCCCCACCAGACGCTGCTT 

6. GCAGAAGCAGCGTCTGGTGGGGAA 
10 7. CTGCCGCTCCATTGAGAACCATC 

8. CAGTGATGGTTCTCAATGGAGCG 

"9. ACTGCTGATACCTTCAGAAAGTT 

10. GAATAACTTTCTGAAGGTATCAG 

11. ATTCAGAGTTTACTCCAACTTCT 
15 12. CTCAAGAAGTTGGAGTAAACTCT 

13. TGAGAGGTAAATTGAAGTTGTACAC 

14. ACCGGTGTACAACTTCAATTTACCT 

15. CGGTGAAGCCTGTAGAACTGGT 

16. CTGTCACCAGTTCTACAGGCTTC 
20 17. GACAGATAAGCCCGACTGATAA 

18. GTTGTTATCAGTCGGGCTTAT 

19. CAACAGTGTAGATGTAACAAAG 

20. TCGACTTTGTTACATCTACACT 



25 
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TABLE XX 
SCEPO SECTION 3 



BamHI Bqlll 1 , 
GATC CAGATCTTTG ACTACTTTGT T pAGAG CTTT 
GTCTAGAAAC TGATGAAACA ACTCTpGAAA 
5 2 

3 I 
GGGTGCTCAA AAGGAAG bCA TT TCCCCACC AGACGCTGCT T CTGCC GCTC 
CCCACGAGTT TTCCTTCGGT~A^AGGGGTGG TCTGCGACGA AGACGGCGAG 
4 6 

7 9 , 11 

CATTGAGAAC CATChCTGCT GATACCTTCA GAAAGTT hTT CA GAGTTTAC 
GTAACTCTTG GTAGTSAcTsA CTATGGAAGT CTTTCAATAA G|TCTCAAATG 
10 8 10 12 

13 15 

TCCAACTTCT frGAGA GGTAA ATTGAAGTTG TACAC CGGT G AAGCCTGTAG 

AGGTTGAAGA ACTcfTCCATT TAACTTCAAC ATGTGGCCAC TTCGGACATC 

1 14 ii 

17 19 

AACTGGT bAC AGA TAAGCCC GACTGATAA C AACA GTGTAG 

TTGACCACT^TclTATTCGGG CTGACTATTG TTGjTCACATC 
15 1 18 

Sai l 

ATGTAACAAA G 
TACATTGTTT CAGCT 
20 



20 



25 
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TABLE XXI 



SCEPO GENE 



Hindlll 



-1 +1 
ArgAla 



AGCTTGGATA AAAGAGCTCC ACCAAGATTG ATCTGTGACT CGAGAGTTTT 
ACCTAT TTTCTCGAGG TGGTTCTAAC TAGACACTGA GCTCTCAAAA 



GGAAAGATAC TTGTTGGAAG CT-AAAGAAGC TGAAAACATC ACCACTGGTT 
CCTTTCTATG AACAACCTTC GATTTCTTCG ACTTTTGTAG TGGTGACCAA 



GTGCTGAACA CTGTTCTTTG AACGAAAACA TTACGGTACC AGACACCAAG 
CACGACTTGT GACAAGAAAC TTGCTTTTGT AATGCCATGG TCTGTGGTTC 



10 GTTAACTTCT ACGCTTGGAA ACGTATGGAA GTTGGTCAAC AAGCTGTTGA 
CAATTGAAGA TGCGAACCTT TGCATACCTT CAACCAGTTG TTCGACAACT 



A'GTTTGGCAA GGTTTGGCCT TGTTATCTGA AGCTGTTTTG AGAGGTCAAG 
TCAAACCGTT CCAAACCGGA ACAATAGACT TCGACAAAAC TCTCCAGTTC 



CCTTGTTGGT TAACTCTTCT CAACCATGGG AACCATTGCA ATTGCACGTC 

GGAACAACCA ATTGAGAAGA GTTGGTACCC TTGGTAACGT TAACGTGCAG 

15 

GATAAAGCCG TCTCTGGTTT GAGATCTTTG ACTACTTTGT TGAGAGCTTT 

CTATTTCGGC AGAGACCAAA CTCTAGAAAC TGATGAAACA ACTCTCGAAA 



GGGTGCTCAA AAGGAAGCCA TTTCCCCACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGGT AAAGGGGTGG TCTGCGACGA AGACGGCGAG 



20 CATTGAGAAC CATCACTGCT GATACCTTCA GAAAGTTATT CAGAGTTTAC 
GTAACTCTTG GTAGTGACGA CTATGGAAGT CTTTCAATAA GTCTCAAATG 



TCCAACTTCT TGAGAGGTAA ATTGAAGTTG TACACCGGTG AAGCCTGTAG 
AGGTTGAAGA ACTCTCCATT TAACTTCAAC ATGTGGCCAC TTCGGACATC 



AACTGGTGAC AGATAAGCCC GACTGATAAC AACAGTGTAG 
TTGACCACTG TCTATTCGGG CTGACTATTG TTGTCACATC 



25 



ATGTAACAAA 
TACATTGTTT 



Sai l 

G 

CAGCT 
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The assembled SCEPO sections were sequenced in 
M13 and Sections 1, 2 and 3 were isolatable from the 
phage as Hind lll/Kpnl , KpnI/BqIII, and Bglll/Sall frag- 
ments. 

5 The presently preferred expression system for 

SCEPO gene products is a secretion system based on 
S.cerevisiae o-f actor secretion, as described in co- 
pending U.S. Patent Application Serial No. 487,753, filed 
April 22, 1983, by Grant A. Bitter, published October 31, 

10 1984 as European Patent Application 0 123,294. Briefly 
put, the system involves constructions wherein DNA 
encoding the leader sequence of the yeast a-factor gene 
product is positioned immediately 5 9 to the coding region 
of the exogenous gene to be expressed. As a result, the 

15 gene product translated includes a leader or signal 

sequence which is "processed off" by an endogenous yeast 
enzyme in the course of secretion of the remainder of the 
product. Because the construction makes use of the ct- 
factor translation initiation (ATG) codon, there was no 

20 need to provide such a codon at the -1 position of the 
SCEPO gene. As may be noted from Table XXI, the alanine 
(+1) encoding sequence is preceded by a linker sequence 
allowing for direct insertion into a plasmid including 
the ONA for the first 80 residues of the a-factor leader 

25 following the a-factor promoter. The specific preferred 
construction for SCEPO gene expression involved a four- 
part ligation including the above-noted SCEPO section 
fragments and the large fragment of Hind lll /Sal l 
digestion of plasmid paC3. From the resulting plasmid 

30 paC3/SCEP0, the a-factor promoter and leader sequence and 
SCEPO gene were isolated by digestion with Bam HI and 
ligated into Bam HI digested plasmid pYE to form 
expression plasmid pYE/SCEPO. 

35 EXAMPLE 12 



The present example relates to expression of 
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recombinant products of the manufactured ECEPO and SCEPO 
genes within the expression systems of Example 11. 

In use of the expression system designed for use 
of E.coll host cells, plasmid p536 of Example 11 was 
5 transformed into AM7 E .coli cells previously transformed 
with a suitable plasmid, pMWl, harboring a C IQ57 gene. 
Cultures of cells in LB broth (Anvpicillin 50 ug/ml and 
kanamycin 5 yg/ml, preferably with 10 mM MgSO^) were 
maintained at 28*C and upon growth of cells in culture to 

10 O.D.g 00 = 0.1, EPO expression was induced by raising the 
culture temperature to 42*C. Cells grown to about 40 
O.D. provided EPO production (as estimated by gel) of 
about 5 mg/OD liter. 

Cells were harvested, lysed, broken with French 

15 Press (10,000 psi) and treated with lysozyme and NP-40 

detergent. The pellet resulting from 24,000 xg centrifu- 
gation was solubilized with guanidine HCl and subjected 
to further purification in a single step by means 'of 
C 4 (Vydac) Reverse Phase HPLC (EtOH, 0-80S, 50 mM NH^Ac, 

20 pH 4.5). Protein sequencing revealed the product to be 
greater than 95% pure and the products obtained revealed 
two different amino terminals, A-P-P-R... and P-P-R... in 
a relative quantitative ratio of about 3 to 1. This 
latter observation of hEPO and [des Ala 1 ] hEPO products 

25 indicates that amino terminal "processing" within the 
host cells serves to remove the terminal methionine and 
in some instances the initial alanine. Radioimmunoassay 
activity for the isolates was at a level of .150,000 to 
160,000' U/mg; in vitro assay activity was at a level of. 

50 30,000 to 62,000 U/mg ; and in vivo assay activity ranged 
from about 120 to 720 U/mg. (Cf., human urinary isolate 
standard of 70,000 U/mg in each assay.) The dose response 
curve for the recombinant product in the iii vivo assay 
differed markedly from that of the human urinary EPO 

35 standard. 



WO bo/02610 PCT/US84/02021 

- 85 - 

The EPO analog plasmids formed in parts A and 8 
of Example 11 were each transformed into pMWl-transf ormed 
AM7 E . coli cells and the cells were cultured as above. 
Purified isolates were tested in both RIA and in vitro 
5 assays. RIA and in^ vitro assay values for [Asn 2 , 
des-Pro 2 through lie 6 ] hEPO expression products were 
approximately 11,000 U/mg and 6,000 U/mg protein, respec- 
tively, while the. assay values for [His 7 ] hEPO were about 
41,000 U/mg and 14,000 U/mg protein, respectively, indi- 

10 eating that the analog products were from one-fourth to 
one-tenth as "active" as the "parent" expression product 
in the assays. 

In the expression system designed for use of 
S.cerevisiae host cells, plasmid pYE/SCEPO was trans- 

15 formed into two different strains, YSDP4 (genotype a 
pep4-3 trpl ) and RK81 (genotype oca pep4-3 trpl ) . 
Transformed YSDP4 hosts were grown in SD medium (Methods 
in Yeast Genetics, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y., p. 62 (1983) supplemented with casa- 

20 mino acids at 0.5%, pH 6.5 at 30*C. Media harvested when 
the cells had been grown to 36 O.D. contained EPO pro- 
ducts at leveis of about 244 U/ml (97 ug/OD liter by 
RIA). Transformed RK81 cells grown to either 6.5 O.D. or 
60 O.D. provided media with EPO concentrations of about 

25 80-90 U/ml (34 ug/OD liter by RIA). Preliminary analyses 
reveal significant heterogeneity in products produced by 
the expression system, likely to be due to variations in 
glycosylation of proteins expressed , and relatively high 
mannose content of the- associated carbohydrate. 

30 Plasmids PaC3 and pYE in HB101 E .coli cells were 

deposited in accordance with the Rules of Practice of the 
U.S. Patent Office on September 27, 1984, with the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, Maryland, under deposit numbers A.T.C.C. 39881 

35 and A.T.C.C. 39882, respectively. Plasmids pCFM526 in 
AM7 cells, pCFM536 in 0M103 cells, and pMWl in JM103 
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cells were likewise deposited on November 21, 1984 as 
A.T.C.C. 33932, 33934, and 33933, respectively. 
Saccharomyces cerevisiae strains YSPD4 and RK81 were 
deposited on November 21, 1984 as A.T.C.C. 20734 and 
5 20733, respectively. 

It should be readily apparent from consideration 
of the above illustrative examples that numerous excep- 
tionally valuable products and processes are provided by 
the present invention in its many aspects. 

10 Polypeptides provided by the invention are 

conspicuously useful materials, whether they are micro- 
bially expressed products or synthetic products, the pri- 
mary, secondary or tertiary structural conformation of 
which was first made known by the present invention. 

15 As previously indicated, recombinant-produced 

and synthetic products of the invention share, to varying 
degrees, the iji vitro biological activity of EPO isolates 
from natural sources and consequently are projected to 
have utility as substitutes for EPO isolates in culture 

20 media employed for growth of erythropoietic cells in 

culture. Similarly, to the extent that polypeptide pro- 
ducts of the invention share the ^in vivo activity of 
natural EPO isolates they are conspicuously suitable , for 
use in erythropoietin therapy procedures practiced on 

25 mammals, including humans, to develop any or all of the 
effects herefore attributed in vivo to EPO, e.g., stimu- 
lation of reticulocyte response, development of ferroki- 
netic effects fsuch as plasma iron turnover effects and 
marrow, transit time effects), erythrocyte mass changes, 

30 stimulation of hemoglobin C synthesis (see, Eschbach, et 
al., supra ) and, as indicated in Example 10, increasing 
hematocrit levels in mammals. Included within the class 
of humans treatable with products of the invention are 
patients generally requiring blood transfusions and 

35 including trauma victims, surgical patients, renal 
disease patients including dialysis patients, and 
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patients with a variety of blood composition affecting 
disorders, such as hemophilia, sickle cell disease, phy- 
siologic anemias, and the like. The minimization of the 
need for transfusion therapy through use of EPO therapy 
5 can be expected to result in reduced transmission of 

infectious agents. Products of the invention, by virtue 
of their production by recombinant methods, are expected 
to be free of pyrogens, natural inhibitory substances, 
and the like, and are thus likely to provide enhanced 

10 overall effectiveness in therapeutic processes vis-a-vis 
naturally derived products. Erythropoietin therapy with 
products of the present invention is also expected to be 
useful in the enhancement of oxygen carrying capacity of 
individuals encountering hypoxic environmental conditions 

15 and possibly in providing beneficial cardiovascular 
effects. 

A preferred method for administration of poly- 
peptide products of the invention is by parenteral (e.g., 
IV, IM, SC, or IP) routes and the compositions admi- 

20 nistered would ordinarily include therapeutically 

effective amounts of product in combination with accep- 
table diluents, carriers and/or adjuvants. Preliminary 
pharmacokinetic studies indicate a longer half-life in 
vivo for monkey EPO products when administered IM rather 

25 than IV. Effective dosages are expected to vary substan- 
tially depending upon the condition treated but thera- 
peutic doses are presently expected to be in the range of 
0.1 (~7U) to 100 (-700011) yg/kg body weight of the active 
* material. Standard diluents such as human serum albumin 

30 are contemplated for pharmaceutical compositions of the 
invention, as are standard carriers such as saline. 

Adjuvant materials suitable for use in com- 
positions of the invention include compounds indepen- 
dently noted for erythropoietic stimulatory effects, such 

35 as testosterones , progenitor cell stimulators, 

insulin-like growth factor, prostaglandins, serotonin, 
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cyclic AMP, prolactin and triiodothyronine, as well as 
agents generally- employed in treatment of aplastic ane- 
mia, such as methenolene, stanozolol and nandrolone [see, 
e.g., Resegotti, et al., Panminerva Medica , 23, , 243-248 
5 (1981); McGonigle, et al., Kidney Int. , 25(2) , 437-444 
(1984); Pavlovic-Kantera, et al., Expt .Hematol . , 8(Supp. 
8), 283-291 C1980); and Kurtz, FEBS Letters, I4a(l) , 
105-108 (1982)] . Also contemplated as adjuvants are 
substances reported to enhance the effects of, or 

10 synergize, erythropoietin or asialo-EPO, such as the 

adrenergic agonists, thyroid hormones, androgens and BPA 
[see, Dunn, "Current Concepts in Erythropoiesis" , John • 
Wiley and Sons (Chichester, England, 1983); Weiland, et 
al., Blut, 44(3) , 173-175 (1982); Kalmanti, Kidney Int. , 

15 22, 383-391 (1982); Shahidi, New .Eng. J .Med . , 289 , 72-80 
(1973); Fisher, et al., Steroids , 30(6) , 833-845 (1977); 
Urabe, et al., J. Exp. Med. , 149 , 1314-1325 (1979); and 
Billat, et al., Expt .Hematol . , 10(1) , 133-140 (1982)] as 
well as the classes of compounds designated "hepatic 

20 erythropoietic factors" [see, Naughton, et al., 

Acta.Haemat. , 69 , 171-179 (1983)] and " erythrotropins" 
[as described* by Congote, et al . in Abstract 364, 
Proceedings 7th International Congress of Endocrinology 
(Quebec City, Quebec, July 1-7, 1984); Congote, 

25 Biochem.Biophys. Res .Comm. , 115(2) , 447-483 (1983) and 
Congote, Anal . Biochem . , 140 , 428-433 (1984)] and 
"erythrogenins" [as described in Rothman, et al., 
J.Surg. Oncol . , 20 , 105-108- ( 1982 )] . Preliminary 
screenings designed to measure erythropoietic responses 

30 of ex-hypoxic polycythemic mice pre-treated with either 
5-a-dihydrotestosterone or nandrolone and then given 
erythropoietin of the present invention have generated 
equivocal results . 

Diagnostic uses of polypeptides of the invention 

35 are similarly extensive and include use in labelled and 
unlablled forms in a variety of immunoassay techniques 
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including RIA's, ELISA's and the like, as well as a 
variety of in vitro and iji vivo activity assays. See, 
e.g., Dunn, et al., Expt .Hematol . , 11(7) , 590-600 (1983); 
Gibson, et al., Pathology , 16, 155-156 (1984); Krystal, 
5 Expt. Hematol. , 11(7) , 649-660 (1983); Saito, et al . , 
Jap. J.Med. , 23(1) , 16-21 (1984); Nathan, et al., 
New Eng. J.Med. , 308(9) , 520-522 (1983); and various 
references pertaining to assays referred to therein. 
Polypeptides of the invention, including synthetic pep- 

10 tides comprising sequences of residues of EPO first 
revealed herein, also provide highly useful pure 
materials for generating polyclonal antibodies and 
"banks 1 * of monoclonal antibodies specific for differing 
continuous and discontinuous epitopes of EPO. As one 

15 example, preliminary analysis of the amino acid sequences 
of Table VI in the context of hydropathicity according to 
Hopp, et al., P.N .A.S. (U.S.A.) , 78 , pp. 3824-3828 
(1981) and of secondary structures according to Chou, et 
al., Ann .Rev . Biochem . , 47, p. 251 (1978) revealed that 

20 synthetic peptides duplicative of continuous sequences of 
residues spanning positions 41-57 inclusive, 116-118 
inclusive and 144-166 inclusive are likely to produce a 
highly antigenic response and generate useful monoclonal 
and polyclonal antibodies immunoreact ive with both the 

25 synthetic peptide and the entire protein. Such antibo- 
dies are expected to be useful in the detection and affi- 
nity purification of EPO and EPO-related products. 

Illustratively, the following three synthetic 
peptides were prepared: 

30 

(1) hEPO 41-57, V-P-D-T-K-V-N-F-Y-A-W-K- 

R-M-E-V-G; 

(2) hEPO 116-128, K-E -A- I -S-P-P-D-A- A- S-A-A; 

(3) hEPO 144-166, V-Y-S-N-F-L-R-G-K-L-K-L-Y- 
35 T-G-E-A-C-R-T-G-D-R. 
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Preliminary immunization studies employing the above- 
noted polypeptides have revealed a relatively weak posi- 
tive response to hEPO 41-57, no appreciable response to 
hEPO 116-128, and a strong positive resopnse to hEPO 
5 144-166, as measured by capacity of rabbit serum antibo- 
dies to immunoprecipitate " I-labelled human urinary EPO 
isolates. Preliminary in vivo activity studies on the 
three peptides revealed no significant activity either 
alone or in combination. 

10 While the deduced sequences of amino acid resi- 

dues of mammalian EPO provided by the illustrative 
examples essentially define the primary structural con- 
formation of mature EPO, it will be understood that the 
specific sequence of 165 amino acid residues of monkey 

15 species EPO in Table V and the 166 residues of human spe- 
cies EPO in Table VI do not limit the scope of useful 
polypeptides provided by the invention. Comprehended by 
the present invention are those various naturally- 
occurring allelic forms of EPO which past research into 

20 biologically active mammalian polypeptides such as human 
Y interferon indicates are likely to exist. (Compare, 
e.g., the human immune interferon species reported to 
have an arginine residue at position No. 140 in EPO 
published application 0 077 670 and the species reported 

25 to have glutamine at position No. 140 in Gray, et al., 
Nature , 295 , pp. 5Q3-508 (1982). Both species are 
characterized as constituting "mature 11 human y interferon 
sequences.) Allelic forms of mature EPO polypeptides may 
vary from each other and from the sequences o.f Tables V 

30 and VI in terms of length of sequence and/or in terms of 
deletions, substitutions, insertions or additions of 
amino acids in the sequence, with consequent potential 
variations in the capacity for glycosylation . As noted 
previously, one putative allelic form of human species 

35 EPO is believed to include a methionine residue at posi- 
tion 126. Expectedly, naturally-occurring allelic forms 
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of EPO-encoding DNA genomic and cDNA sequences are also 
likely to occur which code for the above-noted types of 
allelic polypeptides or simply employ differing codons 
for designation of the same polypeptides as specified, 
5 In addition to naturally-occurring allelic forms 

of mature EPO, the present invention also embraces other 
"EPO products" such as polypeptide analogs of EPO and 
fragments of "mature" EPO . Following the procedures of 
the above-noted published application by Alton, et al. 

10 (WO/83/04053) one may readily design and manufacture 
genes coding for microbial expression of polypeptides 
having primary conformations which differ from that 
herein specified for mature EPO in terms of the identity 
or location of one or more residues (e.g., substitutions, 

15 terminal and intermediate additions and deletions). 

Alternately, modifications of cDNA and genomic EPO genes 
may be readily accomplished by well-known site-directed 
mutagenesis techniques and employed to generate analogs 
and derivatives of EPO. Such EPO products would share at 

20 least one of the biological properties of EPO but may 

differ in others. As examples, projected EPO products of 
the invention include those which are foreshortened by 
e.g., deletions [Asn 2 , des-Pro 2 through Ile 6 ]hEP0, 
[des-Thr 163 through Arg 166 ]hEP0 and " A27-55hEP0" , the 

25 latter having the residues coded for by an entire exon 
deleted; or which are more stable to hydrolysis (and, 
therefore, may have more pronounced or longer lasting 
effects than naturally-occurring EPO); or which have been 
altered to delete one or more a potential sites for gly- 

30 cosylation (which may result in higher activities for 
yeast-produced products); or which have one or more 
cystein residues deleted or replaced by, e.g., histidine 
or serine residues (such as the analog [His 7 ]hEP0) and 
are potentially more easily isolated in active form from 

35 microbial systems; or which have one or more tyrosine 
residues replaced by phenylalanine (such as the analogs 
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[Phe 15 ]hEP0, Ohe 49 ]hEP0, and [Phe 145 ] hEPO) and may bind 
more or less readily to EPO receptors on target cells. 
Also comprehended are polypeptide fragments duplicating 
only a part of the continuous amino acid sequence or 
5 secondary conformations within mature EPO, which 

fragments may possess one activity of EPO (e.g., receptor 
binding) and not others (e.g., erythropoietic activity). 
Especially significant in this regard are those potential 
fragments of* EPO which are elucidated upon consideration 

10 of the human genomic DNA sequence of Table VI, i.e., 

"fragments 1 ' of the total continuous EPO sequence which 
are delineated by intrcn sequences and which may consti- 
tute distinct '"domains" of biological activity. It is 
noteworthy that the absence of in vivo activity for any 

15 one or more of the w EP0 products" of the invention is not 
wholly preclusive of therapeutic utility (see, Weiland, 
et al., supra ) or of utility in other contexts, such as 
in EPO assays or EPO antagonism. Antagonists of erythro- 
poietin may be quite useful in treatment of polycythemias 

20 or cases of overproduction of EPO [see, e.g., Adamson, 
Hosp.Practice , 18C12) , 49-57 C1983), and Hellmann, et 
al., Clin.Lab.Haemat . . 5.» 335-342 (1983)]. 

According to another aspect of the present 
invention, the cloned DNA sequences described herein 

25 which encode human and monkey EPO polypeptides are 

conspicuously valuable for the information which they 
provide concerning the amino acid sequence of mammalian 
erythropoietin which has heretofore been unavailable 
despite decades of analytical processing of isolates^ of 

30 naturally-occurring products. The DNA sequences are also 
conspicuously valuable as products useful in effecting 
the large scale microbial synthesis of erthropoietin by a 
variety of recombinant techniques. Put another way, DNA 
sequences provided by the invention are useful in 

35 generating new and useful viral and circular plasmid DNA 
vectors, new and "useful transformed and transfected 
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microbial procaryotic and eucaryotic host cells 
(including bacterial and yeast cjells and mammalian cells 
grown in culture), and new and useful methods for 
cultured growth of such microbial host cells capable of 
5 expression of EPO and EPO products. DNA sequences of the 
invention are also conspicuously suitable materials for 
- use as labelled probes in isolating EPO and related pro- 
tein encoding cDNA and genomic DNA sequences of mammalian 
spepies other than human and monkey species herein speci- 

10 fically illustrated. The extent to which DNA sequences 
of the invention will have use in various alternative 
methods of protein synthesis (e.g., in insect cells] or 
in genetic therapy in humans and other mammals cannot yet 
be calculated, DNA sequences of the invention are 

15 expected to be useful in developing transgenic mammalian 
species which may serve as eucaryotic "hosts 1 * for produc- 
tion of erythropoietin and erythropoietin products in 
quantity. See, generally, Palmiter, et al., Science , 
222(4625) , 809-814 (1983). 

20 Viewed in this light, therefore, the specific 

disclosures of the illustrative examples are clearly not 
intended to be limiting upon the scope of the present 
invention and numerous modifications and variations are 
expected to occur to those skilled in the art. As one 

25 example,, while DNA sequences provided by the illustrative 
examples include cDNA and genomic DNA sequences, because 
this application provides amino acid sequence information 
essential -to manufacture of. DNA sequence, the invention 
also comprehends such, manufactured DNA sequences- as may 

30 be constructed based on knowledge of EPO amino acid 

sequences. These may code for EPO (as in Example 12) as 
well as for EPO fragments and EPO polypeptide analogs 
(i.e., "EPO Products") which may share one or more biolo- 
gical properties of naturally-occurring EPO but not share 

35 others (or possess others to different degrees). 

DNA sequences provided by the present invention 
are thus seen to comprehend all DNA sequences suitable 
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fdr use in securing expression in a procaryotic or 
eucaryotic host cell of a polypeptide product having at 
least a part of the primary structural conformation and 
one or more of the biological properties of erythro- 
5 poietin, and selected from among: (a] the DNA sequences 
set out in Tables V and VI; (b) DNA sequences which 
hybridize to the DNA sequences defined in (a) or 
fragments thereof; and (c) DNA sequences which, but for 
the degeneracy of the genetic code, would hybridize to 

10 the DNA sequences defined in (a) and (b). It is 

noteworthly in this regard, for example, that existing 
allelic monkey and human EPO gene sequences and other 
mammalian species gene sequences are expected to hybri- 
dize to the sequences of Tables V and VI or to fragments 

15 thereof. Further, but for the degeneracy of the genetic 
code, the SCEPO and ECEPO genes and the manufactured or 
mutagenized cDNA or genomic DNA sequences encoding 
various EPO fragments and analogs would also hybridize to 
the above-mentioned DNA sequences. Such hybridizations 

20 could readily be carried out under the hybridization con- 
ditions described herein with respect to the initial iso- 
lation of the monkey and human EPO-encoding DNA or more 
stringent conditions, if desired to reduce background 
hybridization . 

„25 In a like manner, while the above examples 

illustrate the invention of microbial expression of EPO 
products in the context of mammalian cell expression of 
- DNA inserted in a hybrid vector of bacterial plasmid and 
viral gen-amic-oxigins K a wide variety of expression 

30 systems are within the contemplation of the invention. 
Conspicuously comprehended are expression systems 
involving vectors of homogeneous origins applied to a 
variety of bacterial, yeast and mammlain cells in culture 
as well as to expression systems not involving vectors 

35 (such as calcium phosphate transfection of cells). In 
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this regard, it will be understood .that expression of, 
e.g., monkey origin DNA in monkey host cells in culture 
and human host cells in culture, actually constitute 
instances of "exogenous" DNA expression inasmuch as the 
5 EPO DNA whose high level expression is sought would not 
have its origins in the genome of the host. Expression 
systems of the invention further contemplate these prac- 
tices resulting in cytoplasmic formation of EPO products 
and accumulation of glycosylated and non-glycosylated EPO 

10 products in host cell cytoplasm or membrances Ce.g., 
accumulation in bacterial periplasmlc spaces) or in 
culture medium supernatants as above illustrated, or in 
rather uncommon systems such as P.aeruginosa expression 
systems (described in Gray, et al., Biotechnology , 2, pp. 

15 161-165 (1984)). 

Improved hybridization methodologies of the 
invention, while illustratively applied above to DNA/DNA 
hybridization screenings are equally applicable to 
RNA/RNA and RNA/DNA screening. Mixed probe techniques as 

20 herein illustrated generally constitute a number of 

improvements in hybridization processes allowing for more 
rapid and reliable polynucleotide isolations. These many 
individual processing improvements include: improved 
colony transfer and maintenance procedures; use of nylon- 

25 based filters such as GeneScreen and GeneScreen Plus to 
allow reprobing with same filters and repeated use of the 
filter, application of novel protease treatments 
•[compared, e.g., to Taub, et al. Anal .Biochem. , 126 , pp. 
222-230. (1982)] ; use of very low individual con- 

30 centrations (on the order of 0.025 picomole) of a large 
number of mixed probes (e.g., numbers in excess of 32); 
and, performing hybridization and post-hybridization 
steps under stringent temperatures closely approaching 
(i.e., within 4*C and preferably within 2*C away from) 

35 the lowest calculated dissocation temperature of any of 
the mixed probes employed. These improvements combine to 
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provide results which cauld not be expected to attend 
their use. This is amply illustrated by the fact that 
mixed probe procedures involving 4 times the number of 
probes ever before reported to have been successfully 
5 used in even cDNA screens on messenger RNA species of 

relatively low abundancy were successfully applied to the 
isolation of a unique sequence gene in a genomic library 
screening of 1,500,000 phage plaques. This feat was 
accomplished essentially concurrently with the publica- 
10 tion of the considered opinion of Anderson, et al., 
supra , that mixed probe screening methods were 
"...impractical for isolation of mammalian protein genes 
when corresponding RNA's are unavailable. 
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WHAT IS CLAIMED IS: 

1. A purified and isolated polypeptide having 
part or all of the primary structural conformation and 
5 one or more of the biological properties of naturally- 
occurring erythropoietin and characterized by being the 
product of procaryotic or eucaryotic. expression of an 
exogenous DNA sequence. 

10 2. A polypeptide according to claim 1 further 

characterized by being free of association with any mam- 
malian protein. 



3. A polypeptide 
15 the exogenous DNA sequence 

4. A polypeptide 
the exogenous DNA sequence 
sequence . 

20 

5. A polypeptide 
the exogenous DNA sequence 



according to claim 1 wherein 
is a cDNA sequence. 

according to claim 1 wherein 
is a manufactured DNA 



according to claim 1 wherein 
is a genomic DNA sequence. 



6. A polypeptide according to claim 1 wherein 
25 the exogenous DNA sequence is carried on an autonomously 

replicating circular DNA plasmid or viral vector. 

7. A polypeptide according to claim r 
possessing part or all of. the primary structural confor- 

30 mation of human erythropoietin as set forth in Table VI 
or any naturally occurring allelic variant thereof. 

8. A polypeptide according to claim 1 
possessing part or all of the primary structural confor- 

35 mation of monkey erythropoietin as set forth in Table V 
or any naturally occurring allelic variant thereof. 
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9. A polypeptide according to claim 1 which has 
the immunological properties of naturally-occurring 
erythropoietin, 

5 lO, A polypeptide according to claim 1 which 

has the in vivo biological activity of naturally- 
occurring erythropoietin. 

11.. A polypeptide according to claim 1 which 
10 has the i£ vitro biological activity of naturally- 
occurring erythropoietin. 

12. A polypeptide according to claim 1 further 
characterized by being covalently associated with a 

15 detectable label substance. 

13. A polypeptide according to claim 12 wherein 
said detectable label is a radiolabel. 

20 14. A DNA sequence for use in securing 

expression in a procaryotic or eucaryotic host cell of a 
polypeptide product having at least a part of the primary 
structural conformation and one or more of the biological 
properties of naturally-occurring erythropoietin, said 

25 ON A sequence selected from among: 

Ca] the DNA sequences set out in Tables V and 
VI or their . complementary strands; 

(b) DNA sequences which hybridize to the DNA 
sequences defined "in (a) or fragments thereof; and 

30 (c) DNA sequences which, but for the degeneracy 

of the genetic code, would hybridize to the DNA sequences 
defined in (a) and (b). 

15. A procaryotic or eucaryotic host cell 
35 transformed or transfected with a DNA sequence according 
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to claim 14 in a manner allowing the host cell to express 
said polypeptide product, 

16. A polypeptide product of the expression of 
5 a DNA sequence of claim 14 in a procaryotic or eucaryotic 

host . 

17. ' A purified and isolated DNA sequence coding 
for procaryotic or eucaryotic host expression of a poly- 
Id peptide having part or all of the primary structural con- 
formation and one or more of the biological properties of 
erythropoietin. 



15 



18. A cDNA sequence according to claim 17. 

19. A monkey species erythropoietin coding DNA 
sequence according to claim 18. 



20. A DNA sequence according to claim 19 and 
20 including the protein coding region set forth in Table V 

21. A genomic DNA sequence according to claim 

17. 

25 22. A human species erythropoietin coding DNA 

sequence according to claim 21. 

23. A DNA sequence according to claim 22 and 
including- the protein coding region set forth in Table 

30 VI. 

24. A manufactured DNA sequence according to 

claim 14. 

35 25. A manufactured DNA sequence according to 

claim- 24 and including one or more codons preferred for 
expression in E .coli cells- 
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26. A manufactured DNA sequence according to 
claim 25, coding for expression of human species erythro- 
poietin . 

5 27. A manufactured DNA sequence according to 

claim 26 including the protein coding region set forth in 
Table XIV. 

28. A manufactured DNA sequence according to 
10 claim 24 and including one or more codons preferred for 

expression in yeast cells, 

29. A manufactured DNA sequence according to 
claim 28, coding for expression of human species erythro- 

15 poietin. 

30. A manufactured DNA sequence according to 
claim 29 including the protein coding region set forth in 
Table XXI. 

20 

31. A DNA sequence according to claim 17 cova- 
lently associated with a detectable label substance. 

32. A DNA sequence according to claim 31 
25 wherein the detectable label is a radiolabel. 

33. A single-strand DNA sequence according to 
claim 31. - 

3Q 34. A DNA sequence coding for a polypeptide 

fragment or polypeptide analog of naturally-occurring 
erythropoietin . 



35 
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35. A DNA sequence coding for [phe 13 ]hEP0, 
Ohe 49 ]hEP0-, [Phe 145 ]hEP0, [His 7 ]hEP0, [Asn 2 

des-Pro 2 through Ile 6 ]hEP0, [des-Thr 163 through 
Arg 166 ]hEP0, or [A27-55] hEPO . 

5 

36. A DNA sequence according to claim 34 which 
is a manufactured sequence. 

37. A biologically functional circular plasmid 
10 or viral DNA vector including a DNA sequence according to 

either of claims 14, 17, 34 or 35. 

38. A procaryotic or eucaryotic host cell 
stably transformed or transfected with a DNA vector 

15 according to claim 37. 

39. A polypeptide product of the expression in 
a procaryotic or eucaryotic host cell of a DNA sequence 
according to claims 17 or 34. 

20 

. 40. A glycoprotein product having a primary 
structural conformation sufficiently duplicative of that 
of a naturally-occurring erythropoietin to allow 
possession of one or more of the biological properties 
25 thereof and having an average carbohydrate composition 
which differs from that of naturally-occurring erythro- 
poietin. 

41. A glycoprotein product having a primary 
30 structural conformation sufficiently duplicative of that 
of a naturally-occurring human erythropoietin to allow 
possession of one or more of the biological properties 
thereof and having an average carbohydrate composition 
which differs from that of naturally-occurring human 
35 erythropoietin. 
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42. Vertebrate cells which can be propagated In 
vitro continuously and which upon growth in culture are 
capable of producing in the medium of their growth in 
excess of 100 U of erythropoietin per 10 6 cells in 48 
hours as determined by radioimmunoassay. 

43. Vertebrate cells according to claim 42 
capable, of producing in excess of 500 U erythropoietin 
per 10 6 cells in 48 hours. 

44. Vertebrate cells according to claim 42 
capable of producing in excess of 1,000 U erythropoietin 
per 10 6 cells in 48 hours. 

15 45. Vertebrate cells according to claim 42 

which are mammalian or avian, cells. 

46. Vertebrate cells according to claim 45 
which are COS-1 cells or CHO cells. 



10 



20 



25 



47. A synthetic polypeptide having part or all 
of the amino acid sequence as set forth in Table V and 
having one or more of the i£ vivo or in vitro biological 
activities of naturally-occurring monkey erythropoietin. 



48. A synthetic polypeptide having part or all 
of the amino acid sequence set forth in Table VI, other 
-than a sequence of residues entirely within the sequence 
aumbered 1 through 20, and having a biological property '. 
30 of naturally-occurring human erythropoietin. 

. 49. A synthetic polypeptide having part or all 
of the secondary conformation of part or all of the amino 
acid sequence set forth in Table VI, other than a 
35 sequence "of residues entirely within the sequence num- 
bered 1 through 20, and having a biological property of 
naturally-occurring human erythropoietin. 
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50. A process for the production of a polypep- 
tide having part or all of the primary structural confor- 
mation and on« or more of the biological properties of 
naturally-occurring erythropoietin, said process compri- 

5 sing: 

growing, under suitable nutrient conditions, 
procaryotic or eucaryotic host cells transformed or 
transfected with a DNA vector according to claim 37, and 
isolating desired polypeptide products of the expression 
10 of DNA sequences in said vector. 

51. An antibody substance characterized by 
immunoreactivity with erythropoietin and with a synthetic 
polypeptide having a primary structural conformation 

15 substantially duplicative of a continuous sequence of 
amino acid residues extant in naturally-occurring 
erythropoietin except for any polypeptide comprising a 
sequence of amino acid residues entirely comphrended 
within sequence, 

20 A-P-P-R-L-I-C-D-S-R-V-L-E-R-Y-L-L-E-A-K. 

52. An antibody according to claim 51, which is 
a monoclonal antibody. 

25 53. An antibody according to claim 51, which is 

a polyclonal antibody. 

54. An antibody according to claim 51, which is 
immuno-reactive with erythropoietin and a synthetic poly- 
30 peptide having the sequence selected from the sequences: 
V-P-D-T-K-V-N-F-Y-A-W-K-R-M-E-V-G, 
K-E-A-I-S-P-P-D-A-A-S-A-A, and 

V-Y-S-N-F-L-R-G-K-L-K-L-Y-T-G-E-A-C-R-T-G-D-R. 



35 
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55. A pharmaceutical composition comprising an 
effective amount of a polypeptide according to claims 1, 
16, 39, 40 or 41 and a pharmaceutical^ acceptable 
diluent, adjuvant or carrier, 

56. A method for- providing erythropoietin 
therapy to a mammal comprising administering an effective 
amount of a polypeptide according to claims 1, 16, 39, 40 
or 41. 

57. A method according to claim 56 wherein the 
therapy comprises enhancing hematocrit levels. 

58. A purified and isolated DNA sequence as set 
15 out in Table V or VI or a fragment thereof or the comple- 
mentary strand of such a sequence or fragment. 

59. A polypeptide product of the expression of 
a DNA sequence according to claim 58 in a prqcaryotic or 

20 eucaryotic host cell. 

60. An improvement in the method for detection 
of a specific single stranded polynucleotide of unknown 
sequence in a heterogeneous cellular or viral sample 

25 including multiple single-stranded polynucleotides 
wherien : 

(a) a mixture of labelled single-stranded poly- 
nucleotide probes is prepared having uniformly varying 
sequences .of bases, each o.f said probes being potentially 

30 specifically complementary to a sequence of bases which 
is putatively unique to the polynucleotide to be 
detected, 

(b) the sample is fixed to a solid substrate; 

(c) the substrate having the sample fixed 

35 thereto is treated to diminish further binding of poly- 
nucleotides thereto except by way of hybridization to 
polynucleotides in said sample, 
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(d) the treated substrate having the sample 
fixed thereto is transitorily contacted with said mixture 
of labelled probes under conditions facilitative of 
hybridization only between totally complementary poly- 

5 nucleotides, and, 

(e) the specific polynucleotide is detected by 
monitoring for the presence of a hybridization reaction 
between it and a totally complementary probe within said 
mixture of labelled probes, as evidenced by the presence 

10 of a higher density of labelled material on the substrate 
at the locus of the specific polynucleotide in comparison 
to a background density of labelled material resulting 
from non-specific binding of labelled probes to the 
substrate, 

15 said improvement comprising using in excess of 

32 mixed probes and performance of one or more of the 
following: 

(1) employing a nylon-based paper as said solid 
substrate ; 

20 (2) treating with a protease in step (c); 

.(3) employing individual labelled probe con- 
centrations of approximately 0.025 picomoles; and 

(4) employing as one of the hybridization con- 
ditions in step (dO stringent temperatures approaching to 
25 with 4'C away from the lowest calculated Td of any of the 
probes employed. 



35 
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